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(57) Abstract 

Nucleic acids, including DNA constructs and RNA transcripts, capable of inducing coordinate expression of two to three cistrons 
upon direct introduction into animal tissues, are presented. Bi- or tri-cistronic polynucleotides of this invention include those encoding 
and co-expressing HIV gene products, genes encoding antigens unrelated to HIV, and immunostimulatory gene products, including but not 
limited to GM-CSF, interleukins, interixron and members of the B7 family of proteins which act as T-cell costimulatory elements. The 
methods and polynucleotides of this invention are generally applicable to co-ordinate expression in vivo of any two or more genes in a 
single cell. 
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TITLE OF THE INVENTION 

COORDINATE IN VIVO GENE EXPRESSION 

CROSS-RELATED TO OTHER APPLICATIONS 

This is a continuation-in-part of U. S. Serial Number 
08/207,526, filed March 7, 1994, now pending. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

A method for coordinate expression in a single cell, in 
vivo, of exogenous genes via introduction into the tissue of a vertebrate 
of polycistronic polynucleotide constructs is described. The method 
results in production of immune responses against the products 
produced as a result of expression of the exogenous genes. The method 
and polynucleotide constmcts of this invention may be used in a 
vertebrate to generate immune responses against antigenic epitopes 
expressed by a single cell. The coordinate expression results in 
improved expression of gene products which may be otherwise poorly 
expressed. It also results in improved cellular inunune responses due to 
provision of T-cell stimulatory signals by the same cell expressing T- 
cell antigens. Polynucleotide constmcts encoding human 
immunodeficiency vims (HIV) antigens exemplify one embodiment of 
the method. 

2. Background of the Invention 

A major challenge to the development of vaccines against 
vimses, particularly vimses with a high rate of mutation such as HIV, 
against which elicitation of neutralizing and protective immune 
responses is desirable, is the diversity of the viral envelope proteins 
among different viral isolates or strains. Because cytotoxic T- 
lymphocytes (CTLs) in both mice and humans are capable of 
recognizing epitopes derived from conserved internal viral proteins and 
may be important in the immune response against vimses, efforts have 
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been directed towards the development of CTL vaccines that ehcit 
heterologous protection against different viral strains. 

CD8+ CTLs kill viral ly-infected cells when their T cell 
receptors recognize viral peptides associated with MHC class I 
molecules. These peptides are derived from endogenously synthesized 
viral proteins. Thus, by recognition of epitopes from conserved viral 
proteins, CTLs may provide cross-strain protection. Peptides capable 
of associating with MHC class I for CTL recognition originate from 
proteins that are present in or pass through the cytoplasm or 
endoplasmic reticulum. Exogenous proteins which enter the endosomal 
processing pathway (as in the case of antigens presented by MHC class II 
molecules) are not usually effective in generating CD8"^ CTL responses. 

Efforts to generate CTL responses have used replicating 
vectors to produce the protein antigen within the cell or have introduced 
peptides into the cytosol. These approaches have limitations that may 
Umit their utiHty as vaccines. Retroviral vectors have restrictions on 
the size and stmcture of polypeptides that can be expressed as fusion 
proteins while maintaining the abihty of the recombinant virus to 
replicate. Further, the effectiveness of vectors such as vaccinia for 
subsequent immunizations may be compromised by immune responses 
against the vectors themselves. Also, viral vectors and modified 
pathogens have inherent risks that may hinder their use in humans [R.R. 
Redfield et al, New Engl. J. Med. 316, 673 (1987); L. Mascola et aL. 
Arch. Intern. Med. 149, 1569 (1989)]. Furthermore, the selection of 
peptide epitopes to be presented is dependent upon the stmcture of an 
individual's MHC antigens; thus, peptide vaccines may have limited 
effectiveness due to the diversity of MHC haplotypes in outbred 
populations. 

Benvenisty, N., and Reshef, L. [PNAS 83, 9551-9555, 
(1986)] showed that CaCl2-precipitated DNA introduced into mice 
intraperitoneal ly (i.p.). intravenously (i.v.) or intramuscularly (i.m.) 
.could be expressed. Intramuscular injection of DNA expression vectors 
in mice results in the uptake of DNA by the muscle cells and expression 
of the protein encoded by the DNA [J. A, Wolff et al.. Science 247, 
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1465 (1990); G. Ascadi et aL. Nature 352, 815 (1991)]. The plasmids 
were maintained episomally and did not replicate. Subsequently, 
persistent expression has been observed after i.m. injection in skeletal 
muscle of rats, fish and primates, and cardiac muscle of rats. The 
technique of using nucleic acids as therapeutic agents was reported in 
WO90/1 1092 (4 October 1990), in which naked polynucleotides were 
used to vaccinate vertebrates. 

It is not necessary for the success of the method that 
immunization be intramuscular. Thus, Tang et aL, [Nature, 356, 152- 
154 (1992)] disclosed that introduction of gold microprojectiles coated 
with DNA encoding bovine growth hormone (BGH) into the skin of 
mice resulted in production of anti-BGH antibodies in the mice. Furth 
et aL, [Anal. Biochem. 205, 365-368, (1992)] showed that a jet injector 
could be used to transfect skin, muscle, fat, and mammary tissues of 
living animals. Methods for introducing nucleic acids was recently 
reviewed by Friedman, T., [Science, 244, 1275-1281 (1989)]. 
Robinson et aL, [Abstracts of Papers Presented at the 1992 meeting on 
Modem Approaches to New Vaccines, Including Prevention of AIDS, 
Cold Spring Harbor, p92] reported that i.m., i.p., and i.v. 
administration of avian influenza DNA into chickens provided 
protection against lethal challenge. However, Robinson et al. did not 
disclose which avian influenza virus genes were used. In addition, only 
H7 specific immune responses were alleged; the induction of cross- 
strain protection was not discussed. Intravenous injection of a 
DNAxationic liposome complex in mice was shown by Zhu et al., 
[Science 261:209-21 1 (9 July 1993); see also WO93/24640, 9 Dec. 1993] 
to result in systemic expression of a cloned transgene. Recently, Ulmer 
et al., [Science 259:1745-1749, (1993)] reported on the heterologous 
protection against influenza virus infection by injection of DNA 
encoding influenza virus proteins. 

The need for specific therapeutic and prophylactic agents 
capable of eliciting desired immune responses against pathogens and 
tumor antigens is achieved by the instant invention. Of particular 
importance in this therapeutic approach is the ability to induce T-cell 
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immune responses which can prevent infections or disease caused by 
virus strains which are heterologous to the strain from which the 
antigen gene was obtained. This is of significance with HIV, since HIV 
mutates rapidly, and because many virulent isolates have been identified 
[see, for example, LaRosa et ah. Science 249:932-935 (1990), 
identifying 245 separate HIV isolates]. 

In response to this diversity, researchers have attempted to 
generate CTLs by peptide immunization. Thus, Takahashi et al,, 
[Science 255:333-336 (1992)] reported on the induction of broadly 
cross-reactive cytotoxic T cells recognizing an HIV envelope (gpl60) 
determinant. They recognized the difficulty in achieving a truly cross- 
reactive CTL response and suggested that there is a dichotomy between 
the priming or restimulation of T cells, which is very stringent, and the 
elicitation of effector function, including cytotoxicity, from already 
stimulated CTLs. 

Wang et al., [P.N.A.S. USA 90:4156-4160 (May, 1993)] 
reported on elicitation of immune responses in mice against HIV by 
intramuscular inoculation with a cloned, genomic (unspliced) HIV 
gene. The level of immune response achieved was low, and the system 
utilized portions of the mouse mammary tumor virus (MMTV) long 
terminal repeat (LTR) promoter and portions of the simian vims 40 
(SV40) promoter and terminator. SV40 is known to transform cells, 
possibly through integration into host cellular DNA. Therefore, unlike 
the system described herein, the system described by Wang et al. may be 
inappropriate for administration to humans. In addition, the DNA 
constmct of Wang et al. contains an essentially genomic piece of HIV 
encoding contiguous Tat/REV-gpl60-Tat/REV coding sequences 
(Figure 1). As is described in detail below, this is a suboptimal system 
for obtaining high-level expression of the gpl60. One drawback is that 
the expression of Tat has been recognized to play a contributory role in 
the progression of Kaposi^s Sarcoma, [Y.N. Vaishav and F.W. Wong- 
Staal, An. Rev. Biochem. (1991)]. 

WO 93/17706 describes a method for vaccinating an animal 
against a virus, wherein carrier particles were coated with a gene 
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construct and the coated particles are accelerated into cells of an animal. 
In regard to HIV, essentially the entire genome, minus the long terminal 
repeats, was proposed to be used. That method may represent a 
substantial risk for recipients. Constructs of HIV should, in general, 
contain less than about 50% of the HIV genome to ensure safety of the 
vaccine. Thus, a number of problems remain if a useful human HIV 
vaccine is to emerge from the gene-delivery technology. 

The instant invention uses known methods for introducing 
polynucleotides into Hving tissue to induce expression of proteins. This 
invention provides a immunogen for introducing HIV and other 
proteins into the antigen processing pathway to efficiently generate 
HIV-specific CTLs and antibodies. The pharmaceutical is effective as a 
vaccine to induce both cellular and humoral anti-HIV and HIV 
neutralizing immune responses. The instant invention addresses some of 
the problems by providing polynucleotide immunogens which, when 
introduced into an animal, direct the efficient expression of HIV 
proteins and epitopes without the attendant risks associated with those 
methods. The immune responses generated are effective at recognizing 
HIV, at inhibiting replication of HIV, at identifying and killing cells 
infected with HIV, and are cross-reactive against many HIV strains. 
Therefore, this invention provides a useful immunogen against HIV. 
The invention also provides polynucleotide constructs which enable the 
co-expression, in vivo , of more than one gene-product in a single cell. 
This is demonstrated with an HIV gene expression system in which the 
expression of a first gene is dependent on the co-expression in the same 
cell of a second gene product. By virtue of the success of achieving this 
co-expression in vivo, it is now predictable that this type of 
polynucleotide construct may be appHed to co-expression in vivo of 
many combinations of gene products, including but not limited to viral 
antigens other than HIV related antigens, carcinoma-associated antigens, 
and immunomodulatory or immunostimulatory gene products. 
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SUMMARY OF THE INVENTION 

Nucleic acids, including DNA constructs and RNA 
transcripts, capable of inducing coordinate expression of two to three 
cistrons upon direct introduction into animal tissues, are presented. In 
one embodiment, coordinate expression of two cistrons encoding HIV 
proteins and elicitation of HIV specific immune responses against more 
than one gene products is demonstrated. Cytotoxic T lymphocytes 
(CTLs) specific for viral antigens which respond to different strains of 
human immunodeficiency vims (HIV), and antibodies which are 
generally strain-specific are generated. The generation of such CTLs in 
vivo usually requires endogenous expression of the antigen, as in the 
case of virus infection. To generate a viral antigen for presentation to 
the immune system, without the limitations of direct peptide delivery or 
the use of viral vectors, polynucleotides encoding HIV proteins are 
directly introduced into tissues of vertebrates in vivo , the 
polynucleotides are taken up by cells within the tissue, and the encoded 
proteins produced and processed for presentation to the immune system. 
In mice, this resulted in the generation of HIV-specific CTLs and 
antibodies. Similar results are achieved in primates. These results are 
achieved with bi- or tri-cistronic nucleic acid polynucleotides encoding 
and co-expressing HIV gene products, immunostimulatory gene 
products including but not limited to GM-CSF, interleukins, interferon 
and B7 proteins, which act as T-cell costimulatory elements. The 
methods and polynucleotides of this invention are generally appHcable 
to co-ordinate expression in vivo of any two or three genes. Thus, 
various embodiments of this invention include coordinate expression m 
vivo of viral antigens and immunostimulatory gene products as well as 
coordinate expression of tumor antigens and immunostimulatory genes. 

BRIEF DESCRIPTION OF THE PR AWTNC.S 
Fig. 1 . A schematic representation of the HIV genome. 
Fig. 2. A schematic representation of a polynucleotide constmct of 
this invention capable of inducing the co-ordinate expression 
in vivo in a single cell of up to three gene products encoded 
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by each of three cistrons (I, II, and III). The segments A and 
B represent control sequences including transcription 
termination signals and promoters or internal ribosome entry 
sites (IRES). 

Fig. 3. Detailed schematic of an HIV env polynucleotide immunogen 
construct comprising the CMV-intA transcription promoter, a 
5'-spHce donor, HIV gpl60 (showing gpl20, gp41, and the 
REV-responsive element, RRE), an intemal ribosome entry 
site (IRES), the REV cistron, the BGH transcription 
terminator, and the neomycin resistance marker which is 
driven by a prokaryotic transcription promoter.. 
Fig. 4. Detailed schematic of dicistronic HIV env and gag 

polynucleotide immunogen constructs showing specific 
regulatory elements. 
Fig. 5. Western blot analysis of gpl60 expression induced by HIV 

polynucleotide immunogens. This result rigorously shows the 
coexpression in a single cell of more than one gene product 
from a single polynucleotide construct: A polynucleotide 
encoding gpl60 alone (see panel B, fourth lane from the left) 
expresses no detectable gpl60, but with REV added in trans 
(by cotransfection of a construct encoding only REV), there is 
good gpl60 expression (panel A, fourth lane from the left). 
A genomic tat/REV/env construct expresses only low levels of 
gpl60, whether or not REV is provided in trans (panels A 
and B, third lane). However, a dicistronic gpl60/IRES/REV 
construct heavily expresses gpl60 (panels A and B, fifth lane 
from the left). The best expression, is obtained in a 
dicistronic construct encoding gpl60/IRES/REV, with a splice 
donor (SD) provided 5' to the gpl60 coding sequence (panels 
A and B, right hand lane). Because no additional expression 
is achieved when additional REV is provided in trans (panel A 
right hand lane), the system is not limited by the level of REV 
being expressed. 
Fig. 6. VIJ Sequence. 
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VlJneo Sequence. 
CMVintABGH Sequence. 

Cytotoxic T lymphocytes generated in rhesus monkeys in 
response to VlJ-SIV-p28 polynucleotide construct vaccination 
(REV independent). This SIV p28 is equivalent to p24 gag of 
HIV. Thus, CTLs specific to a group specific antigen are 
inducible using a gag encoding polynucleotide construct. 
Cytotoxic T lymphocytes generated in response to Vaccinia- 
SIVp28 nucleic acid vaccination. This demonstrates that 
similar CTLs are induced by a gag encoding polynucleotide 
(figure 9) as compared with a replicating antigen (vaccinia) 
expressing the same antigen [see Shen, L., et al., Science 
252:440-443, 1991]. 
Fig. 1 1 . Sequence of the Vector V 1 R. 

Fig. 12. Antibodies induced by VlJns-tPA-gpl20, 200 ^ig/mouse per 
round, 2 rounds. 

Fig. 13. Neutralization of HIV-1 (MN) virus by sera from VlJns- 
tPA-gpl20 (MN) DNA vaccinated African Green Monkeys. Panels a 
and B show the reduction in p24 gag protein production for C8166 cells 
infected with HIV-1 (MN) following exposure to the indicated dilutions 
of sera from VlJns-tPA-gpl20 DNA vaccinated monkeys. Data was 
obtained after 10 days in tissue culture following virus inoculation 
(TCID50 per sample). 

Fig. 1 4 T cells from V 1 Jns-tPA-gp 1 20 vaccinated mice exhibiting 
long-term, antigen-specific T lymphocyte memory responses. 
Immunized mice received 1 .6 meg of vaccine DNA twice, six months 
prior to sacrifice. Splenic T cells were cultured in vitro with 
recombinant gpl20 protein at 5 mcg/mL. Proliferation of gpl20- 
specific T cells. A stimulation index (SI; incorporated ^H-thymidine 
for gpl20 treated. T cells :T cells that did not receive antigen). 
Fig. 15. Type 1 T helper (ThI) lymphocyte cytokine secretion by T 
cells from V 1 Jns-tPA-gp 1 20 DNA vaccinated mice. Cell culture 
supematants from the samples shown in Figure 13 were assayed from 
gamma-interferon and interleukin 4 (IL-4) secretion following 



Fig. 7. 
Fig. 8. 
Fig. 9. 



Fig. 10. 



wo 95/24485 



PCT/US95/02633 



- 9 - 



treatment with rgpl20. Immune mice secreted large amounts of 
gamma-interferon and very low amounts of IL-4 indicated that ThI- 
like responses were induced by this vaccine. Control mice showed very 
low amounts of interferon secretion while the IL-4 levels indicated are 
background levels. 

Fig. 15 Anti-gpl20 cytotoxic T lymphocyte (CTL) activities in 
VlJns-tPA-gpl20 DNA vaccinated mice. Two mice (2006 and 2008) 
showed MHC I restricted CTL activities specific to a gpl20 peptide 
(PI 8) following gpl20 DNA vaccinations. No activities were observed 
for these mice in the absence of PI 8 or by a control mouse which had 
not been previously vaccinated. 

Fig. 16. Anti-gpl60 CTL activities by rhesus monkeys vaccinated 
with VlJns-gpl60/IRES/rev and VlJns-tPA-gpl20 DNA vaccines. T 
lymphocyte cultures from all four monkeys receiving these vaccines 
showed MHC I restricted killing of autologous target cells that had been 
treated with vaccinia-gpl60. No CTL activity was observed in four 
control rhesus that had been immunized with 'blank' DNA vaccine 
(VlJns without a gene insert). 

DETAILED DESCRIPTION OF THE INVENTION 

Nucleic acids, including DNA constructs and RNA 
transcripts, capable of inducing coordinate expression of two to three 
cistrons upon direct introduction into animal tissues, are presented. In 
one embodiment, coordinate expression of two cistrons encoding HIV 
proteins and elicitation of HIV specific immune responses against more 
than one gene products is demonstrated. Cytotoxic T lymphocytes 
(CTLs) specific for viral antigens which respond to different strains of 
human immunodeficiency vims (HIV), and antibodies which are 
generally strain-specific are generated. The generation of such CTLs in 
vivo usually requires endogenous expression of the antigen, as in the 
case of vims infection. To generate a viral antigen for presentation to 
the immune system, without the limitations of direct peptide delivery or 
the use of viral vectors, polynucleotides encoding HIV proteins are 
directly introduced into tissues of vertebrates in vivo, the 
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polynucleotides are taken up by cells within the tissue, and the encoded 
proteins produced and processed for presentation to the immune system. 
In mice, this resulted in the generation of HIV-specific CTLs and 
antibodies. Similar results are achieved in primates. These results are 
achieved with bi- or tri-cistronic nucleic acid polynucleotides encoding 
and co-expressing HIV gene products, immunostimulatory gene 
products including but not limited to GM-CSF, interleukins, interferon 
and B7 proteins, which act as T-cell costimulatory elements. The 
methods and polynucleotides of this invention are generally applicable 
to co-ordinate expression in vivo of any two or three genes. Thus, 
various embodiments of this invention include coordinate expression in 
vivo of viral antigens and immunostimulatory gene products as well as 
coordinate expression of tumor antigens and immunostimulatory genes. 

This invention provides polynucleotides which, when 
directly introduced into a vertebrate in vivo , including mammals such as 
primates and humans, induces the expression of encoded proteins within 
the animal. 

As used herein, a polynucleotide is a nucleic acid which 
contains essential regulatory elements such that upon introduction into a 
living vertebrate cell, is able to direct the cellular machinery to produce 
translation products encoded by the genes comprising the 
polynucleotide. 

In one embodiment of the invention, the polynucleotide is a 
polydeoxyribonucieic acid comprising HIV genes operatively linked to a 
transcriptional promoter. In another embodiment of the invention, the 
polynucleotide vaccine comprises polyribonucleic acid encoding HIV 
genes which are amenable to translation by the eukaryotic cellular 
machinery (ribosomes, tRNAs, and other translation factors). Where 
the protein encoded by the polynucleotide is one which does not 
normally occur in that animal except in pathological conditions, (i.e. an 
heterologous protein) such as proteins associated with human 
immunodeficiency virus, (HIV), the etiologic agent of acquired immune 
deficiency syndrome, (AIDS), the animals' immune system is activated 
to launch a protective immune response. Because these exogenous 
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proteins are produced by the animals' own tissues, the expressed 
proteins are processed by the major histocompatibihty system, MHC, in 
a fashion analogous to when an actual infection with the related 
organism, HIV, occurs. The result, as shown in this disclosure, is 
induction of immune responses against the cognate pathogen. 

Accordingly, the instant inventors have prepared nucleic 
acids which, when introduced into the biological system induce the 
expression of HIV proteins and epitopes. The induced antibody 
response is both specific for the expressed HIV protein, and neutralizes 
HIV. hi addition, cytotoxic T-lymphocytes which specifically recognize 
and destroy HIV infected cells are induced. The instant inventors have 
also developed polynucleotides whereby simian immunodeficiency virus 
(SIV) genes are efficiently expressed upon introduction in vivo . This 
achievement is significant because the only animal model closely 
mimicking the human disease, ADDS, is the subhuman primate model 
utilizing SIV. Thus, efficacy of the instant immunogens as vaccines can 
be shown by analogy to the effects obtained in vivo utilizing HIV and 
SIV polynucleotide immunogens. 

There are many embodiments of the instant invention 
which those skilled in the art can appreciate from the specifics taught 
herein. Thus, different transcriptional promoters, terminators, carrier 
vectors or specific gene sequences may be used successfully based on the 
successful invention disclosed herein. 

The instant invention provides a method for using a 
polynucleotide which, upon introduction into mammalian tissue, induces 
the co-expression in a single cell, in vivo , of two or more different, 
discrete gene products. The method is exemplified by using an HIV 
model which demonstrates the co-expression of more than one gene 
product in a single cell upon introduction of the polynucleotide into 
mammalian tissue in vivo . The model is stringent because certain HIV 
genes contain a sequence known as the REV responsive element (RRE). 
These genes are not efficiently expressed unless another HIV gene, 
known as REV, is also present within the cell expressing the RRE- 
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containing HIV gene. This phenomenon is described as REV 
dependence. 

Pavlakis and Felber, WO 93/20212 have described a 
method of ehminating sequences which may induce transcript 
instabihty, which may also achieve some REV independence of certain 
HIV genes. That method may not be generally applicable to all such 
genes, is time-consuming and may require multiple gene modifications. 
Furthermore, the level of expression and immunogenicity of such genes 
may be compromised by elimination of the REV dependence. 

The instant invention provides a different solution which 
does not require multiple manipulations of REV dependent HIV genes to 
obtain REV -independence. In addition, the instant invention is 
applicable to expression of REV independent genes as well as to 
expression of REV dependent genes. The REV-dependent expression 
system described herein, is useful in its own right and is also useful as a 
stringent system for demonstrating the co-expression in a single cell in 
vivo of more than a single desired gene-product. Thus, in any 
circumstance in which it is beneficial to achieve the co-expression, 
within a given cell in vivo , of more than a single gene product, the 
methods and polynucleotide constructs described herein may be 
employed. 

One situation, exemplified herein, is the co-expression of 
an immunogenic epitope and a member of the family of T-cell 
recognition elements known as B7. Recently, Steven Edgington 
[ Biotechnology 11:11 17-1119, 1993] reviewed the coordinate roles of 
B7 and the major histocompatibility complex (MHC) presentation of 
epitopes on the surface of antigen presenting cells in activating CD8+ 
CTLs for the elimination of tumors. Once a MHC molecule on the 
surface of an antigen presenting cell (APC) presents an epitope to a T- 
cell receptor (TCR), B7 expressed on the surface of the same APC acts 
as a second signal by binding to CTLA-4 or CD28. The result is rapid 
division of CD4+ helper T-cells which signal CD8+ T-cells to 
proliferate and kill the APC. Thus, our demonstration herein of 
efficient expression and production of immune responses against an HIV 
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REV dependent gene containing an RRE by coordinately expressing a 
gene for REV, conclusively proves that more than one gene can be co- 
ordinately expressed by introducing a polynucleotide encoding two and 
even three cistrons (defined as a stretch of nucleic acid that carries the 
information for a polypepude chain). 

Because many of the apphcations of the instant invention 
apply to anti-viral vaccination, the polynucleotides are frequently 
referred to as a polynucleotide vaccine (PNV). This is not to say that 
additional utilities of these polynucleotides, in immune stimulation and 
in anti-tumor therapeutics, is to be ignored or considered to be outside 
the scope of the invention. 

In one embodiment of this invention, a gene encoding an 
HIV gene product is incorporated in an expression vector. The vector 
contains a transcriptional promoter recognized by an eukaryotic RNA 
polymerase, and a transcriptional terminator at the end of the HIV gene 
coding sequence. In a preferred embodiment, the promoter is the 
cytomegalovirus promoter with the intron A sequence (CMV-intA), 
although those skilled in the art will recognize that any of a number of 
other known promoters such as the strong immunoglobulin, or other 
eukaryotic gene promoters may be used. A preferred transcriptional 
terminator is the bovine growth hormone terminator. The combination 
of CMVintA-BGH terminator (Fig. 8, SEQ. ID: 13:) is particularly 
preferred. In addition, to assist in preparation of the polynucleotides in 
prokaryotic cells, an antibiotic resistance marker is also preferably 
included in the expression vector under transcriptional control of a 
prokaryotic promoter so that expression of the antibiotic does not occur 
in eukaryotic cells. Ampicillin resistance genes, neomycin resistance 
genes or any other pharmaceutically acceptable antibiotic resistance 
marker may be used. In a preferred embodiment of this invention, the 
antibiotic resistance gene encodes a gene product for neomycin 
resistance. Further, to aid in the high level production of the 
polynucleotide by fermentation in prokaryotic organisms, it is 
advantageous for the vector to contain a prokaryotic origin of 
replication and be of high copy number. Any of a number of 
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commercially available prokaryotic cloning vectors provide these 
benefits. In a preferred embodiment of this invention, these 
functionalities are provided by the commercially available vectors 
knov^n as pUC, It is desirable, however, to remove non-essential DNA 
sequences. Thus, the lacZ and lad coding sequences of pUC are 
removed in one embodiment of the invention. It is also desirable that 
the vectors not be able to replicate in eukaryotic cells. This minimizes 
the risk of integration of polynucleotide vaccine sequences into the 
recipients' genome. 

In another embodiment, the expression vector pnRSV is 
used, wherein the Rous Sarcoma virus (RSV) long terminal repeat 
(LTR) is used as the promoter. In yet another embodiment, VI, a 
mutated pBR322 vector into which the CMV promoter and the BGH 
transcriptional terminator were cloned is used. In a particularly 
preferred embodiment of this invention, the elements of VI and pUC19 
have been combined to produce an expression vector named VI J (SEQ. 
ID: 12:). Into VIJ or another desirable expression vector is cloned an 
HIV gene, such as gpl20, gp41, gpl60, gag, pol, env, or any other HIV 
gene which can induce anti-HIV immune responses (antibody and/or 
CTLs). Exclusion of functional reverse transcriptase and integrase 
functions encoded by the HIV genome is desirable to minimize the risk 
of integration of the polynucleotide vaccine encoded sequences into the 
recipients' genome. In another embodiment, the ampicillin resistance 
gene is removed from VIJ and replaced with a neomycin resistance 
gene, to generate VlJ-neo (SE0.ID:14:), into which any of a number of 
different HIV genes have been cloned for use according to this 
invention. In yet another embodiment, the vector is VlJns, which is the 
same as VlJneo except that a unique Sfii restriction site has been 
engineered into the single Kpnl site at position 2114 of VlJ-neo. The 
incidence of Sfil sites in human genomic DNA is very low 
(approximately 1 site per 100,000 bases). Thus, this vector allows 
careful monitoring for expression vector integration into host DNA, 
simply by Sf i 1 digestion of extracted genomic DNA. In a further 
refinement, the vector is VIR. In this vector, as much non-essential 
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DNA as possible was "trimmed" from the vector to produce a highly 
compact vector. This vector is a derivative of VlJns and is shown in 
Figure 1 1 , (SEQ.ID.rlOO:). This vector allows larger inserts to be used, 
with less concem that undesirable sequences are encoded and optimizes 
uptake by cells when the construct encoding specific influenza virus 
genes is introduced into surrounding tissue. In figure 1 1, the portions 
of VlJneo (Figure 7) that are deleted are shown as a gap, and inserted 
sequence is in bold text, but the numbering of VlJneo is unchanged. 
The foregoing vector modification and development procedures may be 
accomplished according to methods known by those skilled in the art. 
The particular products described however, though obtained by 
conventional means, are especially useful for the particular purpose to 
which they are adapted. 

One embodiment of this invention incorporates genes 
encoding HIV gpl60, gpl20, gag and other gene products from such 
well known laboratory adapted strains of HIV as SF2, IIIB or MN, for 
which a great deal of data has been generated, for example, such as 
showing that chimpanzees can be protected from a lethal challenge of 
HIV inB vims by first administering HIV Illb V3 loop specific 
monoclonal antibody [Emini et ah. Nature 355 : 728-730 1992], or by 
vaccination with recombinant gpl20 but not gpl60 [Herman et al.. 
Nature 345 : 822-825, 1990]. Those skilled in the art will recognize 
that the use of genes from HIV-2 strains having analogous function to 
the genes from HIV-l would be expected to generate immune responses 
analogous to those described herein for HIV-l constmcts. The cloning 
and manipulation methods for obtaining these genes are well known to 
those skilled in the art. 

There has recently been recognition that elicitation of 
immune responses against laboratory adapted strains of HIV may not be 
adequate to provide neutralization of primary, field isolates of HIV, [see 
for example Cohen, J., Science 262 : 980-981, 1993]. Thus, in another 
embodiment of this invention, genes from vimlent, primary field 
isolates of HIV are incorporated in the polynucleotide immunogen. ' 
This is accomplished by preparing cDNA copies of the viral genes and 
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then subcloning the individual genes into the polynucleotide 
immunogen. Sequences for many genes of many HIV strains are now 
publicly available on GENBANK and such primary, filed isolates of 
HIV are available from the National Institute of Allergy and Infectious 
Diseases (NIAID) which has contracted with Quality Biological, Inc., 
[7581 Lindbergh Drive, Gaithersburg, Maryland 20879] to make these 
strains available. Such strains are also now available from the World 
Health Organization (WHO) [Network for HIV Isolation and 
Characterization, Vaccine Development Unit, Office of Research, 
Global Program on AIDS, CH-121 1 Geneva 27, Switzerland]. From 
this work those skilled in the art will recognize that one of the utilities 
of the instant invention is to provide a system for in vivo as well as in 
vitro testing and analysis so that a correlation of HIV sequence diversity 
with serology of HIV neutralization, as well as other parameters can be 
made. The isolation and cloning of these various genes may be 
accomplished according to methods known to those skilled in the art. 
Thus this invention further provides a method for systematic 
identification of HIV strains and sequences for vaccine production. 
Incorporation of genes from primary isolates of HIV strains provides an 
immunogen which induces immune responses against clinical isolates of 
the virus and thus meets a need as yet unmet in the field. Furthermore, 
as the virulent isolates change, the immunogen may be modified to 
reflect new sequences as necessary. 

To keep the terminology consistent, the following 
convention is followed herein for describing polynucleotide immunogen 
constructs: 

"Vector name-HIV strain-gene-additional elements". Thus, 
a construct wherein the gpl60 gene of the MN strain is cloned into the 
expression vector VlJneo, the name it is given herein is: "VlJneo-MN- 
gpl60". The additional elements that are added to the construct are 
described in further detail below. Naturally, as the etiologic strain of 
the virus changes, the precise gene which is optimal for incoiporation in 
the pharmaceutical may be changed. However, as is demonstrated 
below, because cytotoxic lymphocyte responses are induced which are 
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capable of protecting against heterologous strains, the strain variabiHty 
is less critical in the immunogen and vaccines of this invention, as 
compared with the whole virus or subur polypeptide based vaccines. 
In addition, because the pharmaceutical is easily manipulated to insert a 
new gene, this is an adjustment which is easily made by the standard 
techniques of molecular biology. 

To provide a complete description of the instant invention, 
the following background on HIV is provided. The human 
immunodeficiency virus has a ribonucleic acid (RNA) genome, the 
structure of which is represented in Figure 1. This RNA genome must 
be reverse transcribed according to methods known in the art in order 
to produce a cDNA copy for cloning and manipulation according to the 
methods taught herein. At each end of the genome is a long terminal 
repeat which acts as a promoter. Between these termini, the genome 
encodes, in various reading frames, gag-pol-env as the major gene 
products: gag is the group specific antigen; pol is the reverse 
transcriptase, or polymerase; also encoded by this region, in an altemate 
reading frame, is the viral protease which is responsible for post- 
translational processing, for example, of gpl60 into gpl20 and gp41; 
env is the envelope protein; vif is the virion infectivity factor; REV is 
the regulator of virion protein expression; neg is the negative 
regulatory factor; vpu is the virion productivity factor "u"; tat is the 
trans-activator of transcription; vpr is the viral protein r. The function 
of each of these elements has been described (see AIDS 89, A Practical 
Synopsis of the V International Conference, June 4-9, 1989, Montreal, 
A Philadelphia Sciences Group Publication, from which figure 1 was 
adapted). 

In one embodiment of this invention, a gene encoding an 
HIV or SIV protein is directly linked to a transcriptional promoter. 
The env gene encodes a large, membrane bound protein, gpl60, which 
is post-translationally modified to gp41 and gpl20. The gpl20 gene 
may be placed under the control of the cytomegalovirus promoter for 
expression. However, gpl20 is not membrane bound and therefore, 
upon expression, it may be secreted from the cell. As HIV tends to 
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remain dormant in infected cells, it is desirable that immune responses 
directed at cell-bound HIV epitopes also be generated. This goal is 
accomplished herein by expression in vivo of the cell-membrane 
associated epitope, gpl60, to prime the immune system. However, 
expression of gpl60 is repressed in the absence of REV due to non- 
export from the nucleus of non-spliced genes. For an understanding of 
this system, the life cycle of HIV must be described in further detail. 

In the life cycle of HIV, upon infection of a host cell, HIV 
RNA genome is reverse-transcribed into a proviral DNA which 
integrates into host genomic DNA as a single transcriptional unit. The 
LTR provides the promoter which transcribes HIV genes from the 5' to 
3' direction (gag, pol, env), to form an unspliced transcript of the entire 
genome. The unspliced transcript functions as the mRNA from which 
gag and pol are translated, while limited splicing must occur for 
translation of env encoded genes. For the regulatory gene product REV 
to be expressed, more than one spHcing event must occur because in the 
genomic setting, REV and env, as is shown in figure 1 , overlap. In 
order for transcription of env to occur, REV transcription must stop, 
and vice versa. In addition, the presence of REV is required for export 
of unspliced RNA from the nucleus. For REV to function in this 
manner, however, a REV responsive element (RRE) must be present on 
the transcript [Malim et al., Namre 338:254-257 (1989)]. 

In the polynucleotide vaccine of this invention, the 
obhgatory splicing of certain HIV genes is eliminated by providing fully 
spliced genes (i.e.: the provision of a complete open reading frame for 
the desired gene product without the need for switches in the reading 
frame or elimination of noncoding regions; those of ordinary skill in 
the art would recognize that when splicing a particular gene, there is 
some latitude in the precise sequence that results; however so long as a 
functional coding. sequence is obtained, this is acceptable). Thus, in one 
embodiment, the entire coding sequence for gpl60 is spliced, and the 
sequence of REV is spHced, such that no intermittent expression of each 
gene product is required. Furthermore, the features of REV regulated 
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expression are exploited to optimize expression of HIV encoded REV- 
dependent, immunogenic gene products. 

For REV to function as an exporter of transcripts from the 
nucleus to be translated in the cytoplasm, REV requires, in addition to 
the presence of a REV responsive element (RRE) on the transcript to be 
exported, at least one splice donor site on the 5' side of the gene 
containing the RRE [Lu et al., P.N.A.S. USA 87:7598-7602, (October 
1990); Chang and Sharp, Cell 59:789-795 (December 1, 1989)]. The 
instant inventors conceived polynucleotides providing the REV coding 
sequence in a location on the same expression vector as the gene to be 
expressed such that co-expression of REV and the REV responsive gene 
occur without the need for any splicing. Thus, in a preferred 
embodiment of this invention, HIV genes are placed immediately 
downstream from a transcriptional promoter, such as the CMV 
promoter, and the spliced REV coding sequence is placed at a location 
3' to (also referred to as downstream from) the first coding sequence. 
Naturally, the order of these genes could be changed. However, it may 
be preferable to have the immunogenic HIV cistron abut directly to the 
transcriptional promoter to ensure that all transcripts produced encode 
the entire cistron. 

One method for achieving co-expression of genes relies on 
co-transfection of cells in culture with different vectors expressing 
different genes. For a REV dependent gene, the REV gene product 
could be provided in this manner in trans. However, this is suboptimal 
for the purposes of this invention, although not outside the scope of the 
instant invention, because of the low probability that co-transfection of a 
given cell would occur in vivo so as to achieve the necessary availability 
of REV for vigorous expression of REV dependent immunogenic HIV 
gene products. Another method is to provide several promoters on a 
given vector, each promoter controlling expression of a separate gene. 
This amounts to providing REV gene product in cis. This solution may 
be employed according to the instant invention. In such an embodiment, 
it would be preferable for the various promoters and the genes they 
control to run in opposite directions. However, because of the known 
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competitive interference between promoters in this type of multiple 
gene vector, this embodiment is also considered sub-optimal. 

Ghattas et ah, [Moh and Celh Bioh H, No. 12:5848-5859 
(Dec. 1991)], Kaufaman et ah [Nuc. Acids Res. 19, No. 16:4485-4490 
(1991)], and Davies [J. Viroh 66, No. 4:1924-1932 (Apr. 1992)] have 
described an internal ribosome entry site (IRES) in the 
encephalomyocarditis virus (EMCV) leader. They reported that a 
system in which an upstream promoter could be used to initiate 
transcription of a dicistronic mRNA provides good expression of both 
the 5' and 3' open reading frames when an IRES is located between the 
two genes. Chen et al. (J. Viral., 67 : 2142-2145, 1993] have reported a 
system in which the 5 nontranslated region (NTR) from swine vesiculor 
disease vims (SVDV) was used to constmct a bicistronic vims for the 
coexpression of two genes from one transcript from an infectious viral 
vector. 

The instant inventors have discovered that a nucleic acid 
constmct which incorporates coordinated expression of an HIV gene 
containing a REV responsive element (RRE), an internal ribosome entry 
site (IRES) and a REV coding sequence results in efficient expression of 
both REV and the REV dependent gene product. This embodiment of 
the invention is better understood with reference to figures 2 and 3. 
Fig. 2 shows a generalized embodiment while. Figure 3, shows a 
specific embodiment of this invention which, according to the 
nomenclature system described above, is VlJns-gpl60(RRE)-IRES- 
REV. The strain of HIV from which the immunogenic HIV gene is 
derived is irrelevant for the illustrative purposes of this discussion, and 
indeed, the expression of any REV dependent gene product is 
predictably efficient, as is the elicitation of immune responses against 
both REV and the REV dependent gene product, based on the instant 
patent disclosure. According to the embodiment shown in Fig. 3, the 
vector is VI Jns, described above. Thus, the promoter (CMVintA) and 
terminator (BGH) are provided for by the vector, along with a 
prokaryotic origin of replication, to faciUtate large scale production of 
the HIV polynucleotide vaccine through fermentation of bacteria 
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transformed with the construct, according to methods well known in the 
art. This construct does not replicate in eukaryotic tissue, due to the 
absence of an eukaryotic origin of replication. A splice donor site from 
the naturally occurring rev/tat splice donor is provided (rev/tat SD) 
immediately preceding the HIV gene. The gag/pol/env coding sequence 
contains or is followed by a REV responsive element (RRE) which, 
upon formation of the nascent transcript, provides the necessary signals 
for REV binding to and export of the REV dependent mRNA from the 
nucleus. Next, there are sequences provided for reinitiation of 
translation at the internal ribosome entry site (IRES) so that the 
downstream REV coding sequence is efficiently translated. In this 
manner, REV gene product is provided in cis, on the same 
polynucleotide as a REV dependent gene product. 

In further refinements to the instant invention, a third 
cistron may be included in the PNV. The genes encoding such 
immunostimulatory proteins as the B7-antigen presenting cell-surface 
protein, the human granulocyte/monocyte colony stimulatory factor 
(GM-CSF) gene, and cytokine genes such as interleukin and interferon, 
the use of tissue-specific transcriptional promoters and enhancers, are 
all contemplated. The provision of B7 or GM-CSF gene in cis, either 
by insertion of an IRES after REV and before the B7 gene, by provision 
of a second promoter on the same vector construct as the dicistronic 
REV-dependent HIV gene, IRES-REV construct, or in trans using a 
separate construct are all envisioned by extension of the foregoing 
teachings regarding REV and REV dependent genes. The generahzed 
immuno-stimulatory effect of these gene products may be sufficient 
even if provided in trans to enhance immune responses against the HIV 
gene products encoded by the immunogen of this invention. It is 
preferable, particularly for B7, that the same cell presenting HIV 
epitopes in the cleft of MHC-I molecules also present B7. This co- 
presentation of both the antigenic epitope and B7 "closes" the switch 
necessary for T-cell activation. Cytokines, particularly IL-12, which 
modifies whether a predominant humoral or cellular immune response 
is mounted [see Afonso et al.. Science 261:235-237, 1994], either is 
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provided intravenously at the same time that PNV is introduced, or is 
included as a third cistron in the PNV, thereby assuring locaUzed 
production of the interleukin. The genes for these immunostimulatory 
and immunoregulatory proteins, including GM-CSF (see Shaw and 
Kamen, CeU 46:659-667, 1986 ), interleukin- 1 2 (see Wolf, S., et al., L 
Immunol. 146 :3074-3081. 1991) and B7, (see Gordon et ah, L 
Immunol. 143 :2714-2722, 1989; for clones and sequences of newer 
members of the B7 family of proteins, see also Azuma, M., et aL, 
Nature 366:76-79, 1993; and Freeman, G., et aL, Science 262:909-91 1, 
1993) are known and easily cloned and incorporated in PNV's 
according to this invention using methods known to the skilled 
practitioner. Preferably, the genes used for these purposes are the 
human genes so that immune responses against these proteins are 
minimized, allowing the expressed proteins to carry out their 
immunomodulatory and immunostimulatory functions. Where HIV 
genes have been rendered REV -independent, the REV cistron may be 
eliminated completely and a second cistron encoding a B7 gene family 
member and a third cistron encoding yet another gene-product such as 
IL-12, may be constructed. 

The use of tissue-specific promoters or enhancers, for 
example the muscle creatine kinase (MCK) enhancer element, is 
desirable whenever it is desirable to limit expression of the 
polynucleotide to a particular tissue type. For example, myocytes are 
terminally differentiated cells which do not divide. Integration of 
foreign DNA into chromosomes appears to require both cell division 
and protein synthesis. Thus, limiting protein expression to non- 
dividing cells such as myocytes is preferable. However, use of the CMV 
promoter is adequate for achieving expression in many tissues into 
which the PNV is introduced. 

In the various embodiments of this invention which are 
described below, the basic paradigm described above is used. 
Deviations, additions or subtractions from this basic construction design 
serve to hi-light the various aspects of this invention. 
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This patent disclosure exemplifies bi- or tri-cistronic HIV 
polynucleotide immunogens as polynucleotide vaccines, PNVs, to 
generate humoral immunity as well as cross-strain cellular antiviral 
immunity. The system is useful, however, for any two or three 
cistrons, whether or not related to HIV, when co-expression of the 
encoded gene products in a single cell in vivo is required. However, the 
dual humoral and cellular immune responses generated according to this 
invention are particularly significant to inhibiting HIV infection, given 
the propensity of HIV to mutate within the infected population, as well 
as in infected individuals. In order to formulate an effective protective 
vaccine for HIV it is desirable to generate both a multivalent antibody 
response for example to gpl60 (env is approximately 80% conserved 
across various HIV-1, clade B strains, which are the prevalent strains in 
US human populations), the principal neutralization target on HIV, as 
well as cytotoxic T cells reactive to the conserved portions of gpl60 
and, internal viral proteins encoded by gag. We have made an HIV 
vaccine comprising gpl60 genes selected from common laboratory 
strains; from predominant, primary viral isolates found within the 
infected population; from mutated gpl60s designed to unmask cross- 
strain, neutralizing antibody epitopes; from other representative HIV 
genes such as the gag gene (>95% conserved across HIV isolates); and 
from SIV, which provides an animal model for testing the HIV PNV 
wherein non-human primates can be immunized and challenged to test 
viral load and progression to disease. 

Virtually all HFV seropositive patients who have not 
advanced towards an immunodeficient state harbor anti-^a;^^ CTLs while 
about 60% of these patients show cross-strain, gpl60-specific CTLs. 
The amount of HIV specific CTLs found in infected individuals that 
have progressed on to the disease state known as AIDS, however, is 
much lower, demonstrating the significance of our findings that we can 
induce cross-strain CTL responses. Because HIV late gene expression is 
/?£:Vdependent our gpl60 and gag vaccination vectors are designed to 
also produce REV (-90% conserved), to facilitate the REV-dependent 
gene expression. An additional benefit of this invention is that anti- 
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REV immune responses are also generated. This gives further 

advantage to our vaccine because REV is made in large quantities very 

early following infection of a cell, and hours in advance of synthesis of * 

the late gene products, thereby providing an earlier opportunity for 

intervention by vaccine-induced T-cell responses including CTLs and T- 

helper cells. 

In a further embodiment of this invention, a cocktail 
vaccine is prepared in which different HIV REV-dependent gene 
constructs are mixed together to generate anti-REV CTL responses in 
addition to antibodies and CTL against the immunogenic HIV REV- 
dependent gene products. According to this embodiment, one 
polynucleotide encoding gpl60, followed by REV, followed by B7, in a 
tri-cistronic construct having one promoter and two IRES sequences, is 
mixed with another polynucleotide encoding a gag gene product, REV, 
and B7 or another immunomodulatory or immunostimulatory gene 
product such as IL-12 or GM-CSF. In this fashion, with a single or 
several injections of polynucleotide, immune responses against several 
HIV related immunogens can be raised. Likewise, one polynucleotide 
comprising a REV independent gene product, such as those described in 
WO 93/20212, B7, and another immunomodulatory or 
immunostimulatory gene, such as IL-12 or GM-CSF, are mixed with 
another REV-dependent, or REV -independent bi- or tri-cistronic 
expression construct. Furthermore, multiple bi- or tri-cistronic 
constructs encoding HIV or other antigens could be prepared and mixed 
to produce a multivalent combination polynucleotide vaccine. 

Immune responses induced by our env , REV, and gag 
polynucleotide vaccine constmcts are demonstrated in mice, rabbits, and 
primates. Monitoring antibody production to env in mice allows 
confirmation that a given construct is suitably immunogenic, i.e., a high 
proportion of vaccinated animals show an antibody response. Mice also 
provide the most facile animal model suitable for testing CTL induction > 
by our constructs and are therefore used to evaluate whether a 
particular construct is able to generate such activity. However, mouse 
cell lines have been observed to not support efficient REV or tat 
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functions. This observation was made in the context of HIV LTR 
driven expression of late genes and a Hmited amount of data indicates 
that heterologous promoters allow REV function in mouse cells. 
Rabbits and monkeys (African Green, rhesus, chimpanzees) provide 
additional species including primates for antibody evaluation in larger, 
non-rodent animals. These species are also preferred to mice for 
antisera neutralization assays due to high levels of endogenous 
neutralizing activities against retrovimses observed in mouse sera. 
These data demonstrate that sufficient immunogenicity is engendered by 
our vaccines to achieve protection in experiments in a 
chimp anzee/HIVinB challenge model. The currently emerging and 
increasingly accepted definition of protection in the scientific 
community is moving away from so-called "sterilizing immunity", 
which indicates complete protection from HIV infection, to prevention 
of disease. A number of correlates of this goal include reduced blood 
viral titer, as measured either by HIV reverse transcriptase activity, by 
infectivity of samples of semm, by ELISA assay of p24 or other HIV 
antigen concentration in blood, increased CD4"*" T-cell concentration, 
and by extended survival rates [see, for example, Cohen, J., Science 
262 :1820-1821, 1993, for a discussion of the evolving definition of anti- 
HIV vaccine efficacy]. The immunogens of the instant invention also 
generate neutralizing immune responses against infectious (clinical, 
primary field) isolates of HIV. 

Immunology 

A, Antibody Responses to env. 

1. gpl60 and gpl20 . An ELISA assay is used to determine 
whether vaccine vectors expressing either secreted gpl20 or membrane- 
bound gpl60 are efficacious for production of env-specific antibodies. 
Initial in vitro characterization of env expression by our vaccination 
vectors is provided by immunoblot analysis of gpl60 transfected cell 
lysates. These data confirm and quantitate gpl60 expression using anti- 
gp41 and anti-gpl20 monoclonal antibodies to visualize transfectant cell 
gpl60 expression. In one embodiment of this invention, gpl60 is 



wo 95/24485 



PCT/US95/02633 



- 26 - 



preferred to gpl20 for the following reasons: (1) an initial gpl20 
vector gave inconsistent immunogenicity in mice and was very poorly 
or hon-responsive in African Green Monkeys; (2) gpl60 contributes 
additional neutralizing antibody as well as CTL epitopes by providing 
the addition of approximately 1 90 amino acid residues due to the 
inclusion of gp41; (3) gpl60 expression is more similar to viral env 
with respect to tetramer assembly and overall conformation; and (4) we 
find that, like the success of membrane-bound, influenza HA constructs 
for producing neutralizing antibody responses in mice, ferrets, and 
nonhuman primates [see Ulmer et al.. Science 259:1745-1749, 1993; 
Montgomery, D., et al., DNA and Cell Biol. 12:777-7R3. 1993] anti- 
gpl60 antibody generation is superior to anti-gpl20 antibody 
generation. Selection of which type of env , or whether a cocktail of 
env subfragments, is preferred is determined by the experiments 
outlined below. 

2. Presence a nd Breadth of Neutralizing Activitv . ELISA 
positive antisera from rabbits and monkeys is tested and shown to 
neutralize both homologous and heterologous HIV strains. 

3. V3 vs. no n-V3 NeutraHzing Antibodies . A major goal 
for env PNVs is to generate broadly neutralizing antibodies. It has now 
been shovm that antibodies directed against V3 loops are very strain 
specific, and the serology of this response has been used to define 
strains. 

a. Non-V3 neutralizing antibodies appear to primarily 
recognize discontinuous, structural epitopes within gpl20 which are 
responsible for CD4 binding. Antibodies to this domain are polyclonal 
and more broadly cross-neutralizing probably due to restraints on 
mutations imposed by the need for the virus to bind its cellular ligand. 
An in vitro assay is used to test for blocking gpl20 binding to CD4 
immobilized on 96 well plates by sera from immunized animals. A 
second in vitro assay detects direct antibody binding to synthetic 
peptides representing selected V3 domains immobilized on plastic. 
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These assays are compatible for antisera from any of the animal types 
used in our studies and define the types of neutralizing antibodies our 
vaccines have generated as well as provide an in vitro correlate to virus 
neutralization. 

b. gp41 harbors at least one major neutralization 
determinant, corresponding to the highly conserved linear epitope 
recognized by the broadly neutralizing 2F5 monoclonal antibody 
(commercially available from Viral Testing Systems Corp., Texas 
Commerce Tower, 600 Travis Street, Suite 4750, Houston, TX 77002- 
3005(USA), or Waldheim Pharmazeutika GmbH, Boltzmangasse 1 1, A- 
1091 Wien, Austria), as well as other potential sites including the well- 
conserved "fusion peptide" domain located at the N-terminus of gp41. 
Besides the detection of antibodies directed against gp41 by immunoblot 
as described above, an in vitro assay test is used for antibodies which 
bind to synthetic peptides representing these domains immobilized on 
plastic. 

4. Maturation of the Antibodv Response . In HIV 
seropositive patients, the neutralizing antibody responses progress from 
chiefly anti-V3 to include more broadly neutralizing antibodies 
comprising the stmctural gpl20 domain epitopes described above (#3), 
including gp41 epitopes. These types of antibody responses are 
monitored over the course of both time and subsequent vaccinations. 

B. T Cell Reactivities Against env . REV, ngf and ga^. 

1 . Generation of CTL Responses . Viral proteins which are 
synthesized within cells give rise to MHC I-restricted CTL responses. 
Each of these proteins ehcit CTL in seropositive patients. Our vaccines 
also are able to ehcit CTL in mice. The immunogenetics of mouse 
strains are conducive to such studies, as demonstrated with influenza 
NP, [see Ulmer et al.. Science 259:1745-1749, 1993]. Several epitopes 
have been defined for the HIV proteins env, REV, nef and gag in 
Balb/c mice, thus facilitating in vitro CTL culture and cytotoxicity 
assays. Additionally, it is advantageous to use syngenic tumor lines. 
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such as the murine mastocytoma P815, transfected with these genes to 
provide targets for CTL as well as for in vitro antigen specific 
restimulation. Methods for defining immunogens capable of eliciting 
MHC class I-restricted cytotoxic T lymphocytes are known [see Calin- 
Laurens, et al., Vaccine ll(9):974-978, 1993; see particularly Eriksson, 
et al., Vaccine ll(8):859-865, 1993, wherein T-cell activating epitopes ' 
on the HIV gpl20 were mapped in primates and several regions, 
including gpl20 amino acids 142-192, 296-343, 367-400, and 410-453 
were each found to induce lymphoproliferation; furthermore, discrete 
regions 248-269 and 270-295 were lymphoprohferative. A peptide 
encompassing amino acids 152-176 was also found to induce HIV 
neutralizing antibodies], and these methods may be used to identify 
immunogenic epitopes for inclusion in the PNV of this invention. 
Alternatively, the entire gene encoding gpl60, gpl20, protease, or gag 
could be used. For additional review on this subject, see for example, 
Shirai et al., JJnimunol 148:1657-1667, 1992; Choppin et al., L 
Immunol 142:569-574, 1991; Choppin et al.. J. Immunol 147 :575-583, 
1991; Berzofskyetal.,JjClinJiiyest. 88:876-884, 1991. As used 
herein, T-cell effector function is associated with mature T-cell 
phenotype, for example, cytotoxicity, cytokine secretion for B-cell 
activation, and/or recruitment or stimulation of macrophages and 
neutrophils. 

2- Measurement nfTn ArtiviH^c Spleen cell cultures 
derived from vaccinated animals are tested for recall to specific antigens 
by addition of either recombinant protein or peptide epitopes. 
Activation of T cells by such antigens, presented by accompanying 
splenic antigen presenting cells, APCs, is monitored by proliferation of 
these cultures or by cytokine production. The pattern of cytokine 
production also allows classification of Th response as type 1 or type 2. 
Because dominant Th2 responses appear to correlate with the exclusion 
of cellular immunity in immunocompromised seropositive patients, it is 
possible to define the type of response engendered by a given PNV in 
patients, permitting manipulation of the resulting immune responses. 
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3. Delayed Type Hypersensitivity rOTHV DTH to yiral 
antijgen after i.d. injection is indicative of cellular, primarily MHC 11- 
restricted, immunity. Because of the commercial availability of 
recombinant HIV proteins and synthetic peptides for known epitopes, 
DTH responses are easily determined in vaccinated vertebrates using 
these reagents, thus providing an additional in vivo correlate for 
inducing cellular immunity. 

Protection 

Based upon the above immunologic studies, it is predictable 
that our vaccines are effective in vertebrates against challenge by 
virulent HIV. These studies are accomplished in an 
HIVlllB/chimpanzee challenge model after sufficient vaccination of 
these animals with a PNV construct, or a cocktail of PNV constructs 
comprised of gpl60lIIB, ^«^IIIB, neflllB and REVuiB- The lEB 
strain is useful in this regard as the chimpanzee titer of lethal doses of 
this strain has been established. However, the same studies are 
envisioned using any strain of HIV and the epitopes specific to or 
heterologous to the given strain. A second vaccination/challenge model, 
in addition to chimpanzees, is the scid-hu PBL mouse. This model 
allows testing of the human lymphocyte immune system and our vaccine 
with subsequent HIV challenge in a mouse host. This system is 
advantageous as it is easily adapted to use with any HIV strain and it 
provides evidence of protection against multiple strains of primary field 
isolates of HIV. A third challenge model utilizes hybrid HIV/SIV 
viruses (SHIV), some of which have been shown to infect rhesus 
monkeys and lead to immunodeficiency disease resulting in death [see 
Li, J., et al., J. AIDS 5:639-646, 1992]. Vaccination of rhesus with our 
polynucleotide vaccine constructs is protective against subsequent 
challenge with lethal doses of SHIV. 



PNV Construct Summary 
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HIV and other genes are preferably ligated into an 
expression vector which has been specifically optimized for 
polynucleotide vaccinations. According to this invention disclosure 
methods for producing several such vectors are enabled. Essential i all 
extraneous DNA is removed, leaving the essential elements of 
transcriptional promoter, immunogenic epitopes, and additional cis— ns 
encoding immunoenhancing or iimnunomodulatory genes, with the: 
own promoters or IRES, transcriptional terminator, bacterial orign r 
replication and antibiotic resistance gene, as previously described ( sr 
figure 2). Those skilled in the art will appreciate that introduction : 
RNA which has been transcribed in vitro to produce the multi-cistro- j 
mRNAs encoded by the DNA counterparts of this invention natural 
forms an integral part of this invention. For this purpose, it is desiroie 
to use as the transcriptional promoter such powerful RNA polymera= 
promoters as the T7 or SP6 promoters, and performing run-on 
transcription with a linearized DNA template. These methods are v.'-:i 
known in the art. 

Expression of HIV late genes such as env and gag is RI - 
dependent and requires that the REV response element (RRE) be pr^ent 
on the viral gene transcript. A secreted form of gpl20 can be generced 
in the absence of REV by substitution of the gpl20 leader peptide \\:n 
a heterologous leader such as from tPA (tissue-type plasminogen 
activator), and preferably by a leader peptide such as is found in hipy 
expressed mammalian proteins such as immunoglobulin leader pepties. 
We have inserted a tPA-gpl20 chimeric gene into VlJns which 
efficiently expresses secreted gpl20 in transfected cells (RD, a hum:, 
rhabdomyosarcoma line). We have also developed an IRES-based 
(IRES = internal ribosomal entry site) dicistronic VlJns vector 
containing both gpl60 (which harbors the RRE) and REV which 
efficiently expresses gpl60 in transfected cell lines (293, a human 
embryonic kidney cell line; and RD). Monocistronic gpl60 does n: 
produce any protein upon transfection without the addition of a RE 
expression vector. Dicistronic gpl60/^£K produces similar amour 
of gpI60 as co-transfected gpl60 and REV monocistronic vectors. 
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From these studies, it is predictable that dicistronic vectors more 
efficiently express gpl60 following introduction in vivo intramuscularly 
relative to a mixture of gpl60 and REV vectors because the dicistron 
^ insures the proximity of gpl60 construct and REV within stmcturally 
extended, multi-nucleated muscle cells. This dicistronic strategy also 
supports expression of gag after the inclusion of the RRE within the 
transcript region of the vector. It also supports the expression of 
unrelated genes in a bi- or tri-cistronic PNV, such as co-expression of 
HIV immunogenic epitopes, influenza vims immunogenic epitopes, 
cancer- related antigens, and immunomodulatory genes such as 
interleukin, B7 and GM-CSF. 

Representative Constmct Components hiclude (but are not restricted to) 
(see figure 2. Cistrons L and III): 

1. tPA-gpl20MN; 

2. gpl60lllB/IRES//?£KlllB; 

3. gpl60lllB; 

4. /?£KIIIB; 

5. tat/REV/gpl60 (a genomic IIIB clone which weakly 

expresses gpl60); 

6. REV/gpl60; 

7. gpl60MN; 

8. gpl60 from clinically relevant primary HIV isolates; 
2^ 9, nef, using the gene from clinically relevant strains; 

10. gagUlB: for anti-j?ag CTL; 

11. tPA-gpl20llIB: for chimp studies; 

12. gpl60 with structural mutations: V3 loop substitutions 
from clinically relevant strains of HIV; several mutations 
on several constmcts such as variable loop removal, Asn 
mutations to remove steric carbohydrate obstacles to 

/ stmctural, neutralizing antibody epitopes; and CD4 

binding site knockout mutants; 

13. gp41: to specifically elicit anti-gp41 neutralizing 
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antibodies, particularly the 2F5 monoclonal antibody 
epitope, located directly anterior to the transmembrane 
domain, which is broadly conserved across many strains. 
This peptide is difficult to express in the absence of gpl20 
and requires several strategies, e.g., a recent report found 
that the 2F5 epitope spliced into an influenza HA loop tip 
could elicit HIV neutralizing antibodies; alternatively, 
provision of appropriate leader sequences, as in the tPA 
signal peptide leader sequence, allows expression of this 
gene product; 

14. ^ag: similar to construct from #5 above, using the 
gene from clinically relevant strains; 

15. /-<?v: for gpl60 and ^'fl.i,' dicistronics; 

16. B7 coding sequences; 

17. GM-CSF sequences; 

18. Interleukin sequences, particularly encoding IL-12; 

19. Tumor associated antigens; 

20. Genes encoding antigens expressed by pathogens other 
than HIV, such as, but not limited to, influenza vims 
nucleoprotein, hemagglutinin, matrix, neuraminidase, and 
other antigenic proteins; herpes simplex virus genes; human 
papillomavirus genes; tuberculosis antigens; hepatitis A, B, 
or C virus antigens; and combinations of these and other 
antigens to form at least dicistronic constructs which may 
be combined with multiple other polycistronic constructs to 
provide a cocktail composition capable of raising immune 
responses against all of the represented pathogens or 
tumor antigens. 

In the HIV env constructs, those of ordinary skill in the art 
will recognize the desirability of expressing nucleic acids encoding 
various env V3 loop amino acid sequences. As an example, any or all 
of the following amino acid sequences, or portions thereof, may be 
encoded by HIV polynucleotide immunogens of this invention: 
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GP160 V3 LOOP SEQUENCE SUMMARY FOR PNV 
CONSTRUCTS 

^ North American/European Consensus, SEQ.ID:!: 

CysThrArgProAsnAsnAsnThrArgLysSerlleHisIleGlyProGlyArgAla 
PheTyrThrThrGlyGluIIelleGlyAspIleArgGlnAIaHisCys 

MN, SE0.ID:2: 

CysThrArgProAsnTyrAsnLysArgLysArglleHisIleGlyProGlyArgAla 
PheTyrThrThrLysAsnllelleGlyThrlleArgGlnAlaHisCys 

IIIB (HXB2R), SEQ.ID:3: 

CysThrArgProAsnAsnAsnThrArgLysArglleArglleGlnArgGlyProGly 
ArgAIaPheValThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCys 

116-v, SEQ.ID:4: 

CysThrArgProAsnAsnAsnThrArgLysGIylleHisIleGIyProGlyArgAla 
PheTyrThrThrGlyLysIlelleGlyAsnlleArgGinAlaHisCys 

^° 452-p, SEQ.ID:5: 

CysThrArgProSerAsnAsnAsnThrArgLysSerlleHisIleGlyProGlyLys 
AlaPheTyrAlaThrGlyAlallelleGlyAspIleArgGlnAlaHisCys 

146-v, SEQ.ID:6: 

25 

CysThrArgProAsnAsnAsnThrArgArgSerlleHisIleAIaProGlyArgAla 
PheTyrAlaThrGlyAspIlelleGlyAspIleArgGlnAlaHisCys 

The protective efficacy of polynucleotide HIV immunogens 
jQ against subsequent viral challenge is demonstrated by immunization with 
the non-replicating plasmid DNA of this invention. This is 
advantageous since no infectious agent is involved, no assembly of virus 
particles is required, and determinant selection is permitted. 
Furthermore, because the sequence of gag and protease and several of 
the other viral gene products is conserved among various strains of 
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HIV, protection against subsequent challenge by a virulent strain of HIV 

that is homologous to, as well as strains heterologous to the strain from 

which the cloned gene is obtained, is enabled. ' 

The i.m. injection of a DNA expression vector encoding 
gpl60 results in the generation of significant protective immunity 
against subsequent viral challenge. In particular, gp 1 60-specific 
antibodies and primary CTLs are produced. Immune responses directed 
against conserved proteins can be effective despite the antigenic shift 
and drift of the variable envelope proteins. Because each of the HIV 
gene products exhibit some degree of conservation, and because CTLs 
are generated in response to intracellular expression and MHC 
processing, it is predictable that many virus genes give rise to responses 
analogous to that achieved for gpl60. Thus, many of these genes have 
been cloned, as shown by the cloned and sequenced junctions in the 
expression vector (see below) such that these constructs are 
immunogenic agents in available form. 

The invention offers a means to induce cross-strain 
protective immunity without the need for self-replicating agents or 
adjuvants. In addition, immunization with the instant polynucleotides 
offers a number of other advantages. First, this approach to vaccination 
should be applicable to tumors as well as infectious agents, since the 
CD8+ CTL response is important for both pathophysiological processes 
[K. Tanaka et al, Annu. Rev. Immunol. 6, 359 (1988)]. Therefore, 
eliciting an immune response against a protein crucial to the 
transformation process may be an effective means of cancer protection 
or immunotherapy. Second, the generation of high titer antibodies 
against expressed proteins after injection of viral protein and human 
growth hormone DNA, [see for example D.-c. Tang et al., Nature 356, 
152, 1992], indicates this is a facile and highly effective means of 
making antibody-based vaccines, either separately or in combination 
with cytotoxic T-lymphocyte vaccines targeted towards conserved 
antigens. 

The ease of producing and purifying DNA constructs 
compares favorably with traditional protein purification, facilitating the 
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generation of combination vaccines. Thus, multiple constructs, for 
example encoding gpl60, gpl20, gp41, or any other HIV gene may be 
prepared, mixed and co -administered. Finally, because protein 
expression is maintained following DNA injection [H. Lin et ai. 
Circulation 82, 2217 (1990); R.N. Kitsis et aL, Proc. Natl. Acad. Sci. 
(USA) 88, 4138 (1991); E. Hansen et al, FEBS Lett. 290, 73 (1991); 
S. Jiao et aL. Hum. Gene Therapy 3, 21 (1992); J.A. Wolff et al, 
Human Mol. Genet. 1, 363 (1992)], the persistence of B- and T-cell 
memory may be enhanced [D. Gray and P. Matzinger, J. Exp. Med. 
174, 969 (1991); S. Oehen et a/., ibid. 176, 1273 (1992)], thereby 
engendering long-lived humoral and cell-mediated immunity. 

The standard techniques of molecular biology for 
preparing and purifying DNA constmcts enable the preparation of the 
DNA immunogens of this invention. While standard techniques of 
molecular biology are therefore sufficient for the production of the 
products of this invention, the specific constructs disclosed herein 
provide polynucleotide immunogens which surprisingly produce cross- 
strain and primary HIV isolate neutralization, a result heretofore 
unattainable with standard inactivated whole vims or subunit protein 
vaccines. 

The amount of expressible DNA or transcribed RNA to be 
introduced into a vaccine recipient will depend on the strength of the 
transcriptional and translational promoters used and on the 
immunogenicity of the expressed gene product. In general, an 
immunologically or prophylactically effective dose of about 1 ng to 100 
mg, and preferably about 10 |ag to 300 |Lig is administered directly into 
muscle tissue. Subcutaneous injection, intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also 
contemplated. It is also contemplated that booster vaccinations are to be 
provided. Following vaccination with HIV polynucleotide immunogen, 
boosting with HIV protein immunogens such as gpl60, gpl20, and gag 
gene products is also contemplated. Parenteral administration, such as 
intravenous, intramuscular, subcutaneous or other means of 
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administration of inteiieukin-1 2 protein, concurrently with or 
subsequent to parenteral introduction of the PNV of this invention a; 
advantageous. 

The polynucleotide may be naked, that is, unassociated v:th 
any proteins, adjuvants or other agents which impact on the recipien:^ 
immune system. In this case, it is desirable for the polynucleotide tc 
in a physiologically acceptable solution, such as, but not limited to, 
sterile saline or sterile buffered saline. Alternatively, the DNA max re 
associated with liposomes, such as lecithin liposomes or other liposones 
known in the art, as a DNA-liposome mixture, or the DNA may be 
associated with an adjuvant known in the art to boost immune responses, 
such as a protein or other carrier. Agents which assist in the celluk* 
uptake of DNA, such as, but not limited to, calcium ions, may also bt 
used to advantage. These agents are generally referred to herein as 
transfection faciHtating reagents and pharmaceutically acceptable 
carriers. Techniques for coating microprojectiles coated with 
polynucleotide are known in the art and are also useful in connectior 
with this invention. 

Accordingly, one embodiment of this invention is a 
polynucleotide which, upon introduction into mammaUan tissue, indjces 
the co-expression in a single cell, in vivo , of two or three different, 
discrete gene products, comprising: 

a first transcriptional promoter which operates efficiently in eukar}':ric 
cells upstream from and in transcriptional control of a first cistron: 
a second cistron downstream from the first cistron, under 
transcriptional control either of the first transcriptional promoter, c - 
under control of a second transcriptional promoter; 
optionally, a third cistron downstream from the second cistron, undt- 
transcriptional control either of the first transcriptional promoter, 
under control of a second transcriptional promoter, or under contrc . jf 
a third transcriptional promoter; 

a transcriptional terminator following each of the first, second and ziird 
cistron, unless followed by another citron lacking its own 
transcriptional promoter. 
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In another embodiment, the invention is a polynucleotide 
which comprises contiguous nucleic acid sequences which cannot 
repHcate in eukaryotic cells but which are capable of being expressed to 
produce a gene product upon introduction of the polynucleotide into 
eukaryotic tissues in vivo . The encoded gene product preferably either 
acts as an immuno stimulant or as an antigen capable of generating an 
immune response. Thus, the nucleic acid sequences in this embodiment 
encode a spliced REV gene, a human immunodeficiency virus (HIV) 
immunogenic epitope, and optionally, a cytokine or a T-cell 
costimulatory element, such as a member of the B7 family of proteins. 

In another embodiment, the invention is a method for co- 
expression in a single cell, in vivo, of two or three different, discrete 
gene products, which comprises introducing between about 0.1 |ig and 
100 mg of a polynucleotide of this invention into the tissue of the 
vertebrate. 

In another embodiment, the invention is a method for using 
a REV dependent HIV gene to induce immune responses in vivo which 
comprises: 

a) isolating the REV dependent HIV gene; 

b) linking the isolated gene to regulatory sequences such 
that the gene is expressible by virtue of being operatively linked to 
control sequences which, when introduced into a living tissue, direct the 
transcription initiation and subsequent translation of the gene; 

c) introducing the expressible gene into a living tissue; 

d) introducing a gene encoding HIV REV either in trans or 
in cis to the HIV REV dependent gene; and 

e) optionally, boosting with additional expressible HIV 

gene. 

A further embodiment of this invention amounts to a 
method of inducing an antigen presenting cell to stimulate cytotoxic T- 
cell proliferation specific to HIV antigens. This involves exposing cells 
of a vertebrate in vivo to a polynucleotide which consists of an antigenic 
HIV epitope, REV if the antigenic HIV epitope depends on REV for 
efficient expression, and B7 encoding sequences. 
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The following examples are provided to further define the 
invention, without limiting the invention to the specifics of the 
examples. 

Materials descriptions 

Vectors pF41 1 and pF412: These vectors were subcloned 
from vector pSP62 which was constructed in R. Gallo's lab. pSP62 is 
an available reagent from Biotech Research Laboratories, Inc. pSP62 
has a 12.5 kb Xbal fragment of the HXB2 genome subcloned from 
lambda HXB2. Sail and Xba I digestion of pSP62 yields to HXB2 
fragments: 5'-XbaI/SalI, 6.5 kb and 3'- Sall/Xbal, 6 kb. These inserts 
were subcloned into pUC 18 at Smal and Sail sites yielding pF41 1 (5'- 
Xbal/Sall) and pF412 (3'-XbaI/SalI). pF41 1 contains gag/pol and 
pF4 1 2 contains tat/rev/env/nef . 

Repligen reagents: 
recombinant rev (IIIB), #RP1024-10 
rec. gpl20 (HIB), #RP 1001-10 
anti-rev monoclonal antibody, #RP1029-10 
anti-gpl20 mAB, #1C1, #RP1010-10 

AIDS Research and Reference Reagent Program: 
anti-gp41 mAB hybridoma, Chessie 8, #526 

EXAMPLE 1 
VECTORS FOR VACCINE PRODIJCTTON 

A) VI : The expression vector VI was constructed from pCMVIE-AKI- 
DHFR [Y. Whang et al.. J. Virol. 61, 1796 (1987)]. The AKI and 
DHFR genes were removed by cutting the vector with EcoR I and self- 
ligating. This vector does not contain intron A in the CMV promoter, 
so it was added as a PCR fragment that had a deleted intemal Sac I site 
[at 1855 as numbered in B.S. Chapman et al., Nuc. Acids Res. 19, 3979 
(1991)]. The template used for the PCR reactions was pCMVintA-Lux, 
made by ligating the Hind III and Nhe I fragment from pCMV6al20 
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[see B.S. Chapman et aL, ihid.,] which includes hCMV-IEl 
enhancer/promoter and intron A, into the Hind III and Xba I sites of 
pBL3 to generate pCMVIntBL. The 1881 base pair luciferase gene 
fragment (Hind Ill-Sma I Klenow filled-in) from RSV-Lux [J.R. de Wet 
^ et aL, MoL Cell Biol. 7, 725, 1987] was cloned into the Sal I site of 
pCMVIntBL, which was Klenow filled-in and phosphatase treated. 

The primers that spanned intron A are: 
5' primer, SEQ. ID:7: 

5'-CTATATAAGCAGAG CTCGTTTAG-3 The 3' primer, SEQ ID:8: 
5'-GTAGCAAAGATCTAAGGACGGTGA CTGCAG-3'. 

The primers used to remove the Sac I site are: 
sense primer, SEQ ID:9: 

5-GTATGTGTCTGAAAATGAGCGTGGAGATTGGGCTCGCAC-3' 
and the antisense primer, SEQ ID: 10:, 

5 -GTGCGAGCCCAATCTCCACGCTCATTTTCAGACACA TAC-3\ 
The PGR fragment was cut with Sac I and Bgl II and 

20 

inserted into the vector which had been cut with the same enzymes. 

B^ VI J EXPRESSION VECTOR, SEP, ID: 12: 

Our purpose in creating VI J was to remove the promoter 
and transcription termination elements from our vector, V 1 , in order to 
place them within a more defined context, create a more compact 
vector, and to improve plasmid purification yields. 

VIJ is derived from vectors VI, (see Example 1) and 
pUC18, a commercially available plasmid. VI was digested with Sspl 
and EcoRI restriction enzymes producing two fragments of DNA. The 

30 

smaller of these fragments, containing the CM Vint A promoter and 
Bovine Growth Hormone (BGH) transcription termination elements 
which control the expression of heterologous genes (SEQ ID: 13:), was 
purified from an agarose electrophoresis gel. The ends of this DNA 
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fragment were then "blunted" using the T4 DNA polymerase enzyme in 
order to facilitate its ligation to another "blunt-ended" DNA fragment. 

pUC18 was chosen to provide the "backbone" of the 
expression vector. It is known to produce high yields of plasmid, is 
well-characterized by sequence and function, and is of minimum size. 
We removed the entire lac operon from this vector, which was 
unnecessary for our purposes and may be detrimental to plasmid yields 
and heterologous gene expression, by partial digestion with the Haell 
restriction enzyme. The remaining plasmid was purified from an 
agarose electrophoresis gel, blunt-ended with the T4 DNA polymerase , 
treated with calf intestinal alkaline phosphatase, and ligated to the 
CMVintA/BGH element described above. Plasmids exhibiting either of 
two possible orientations of the promoter elements within the pUC 
backbone were obtained. One of these plasmids gave much higher 
yields of DNA in E. coli and was designated VIJ (SEQ. ID: 12:). This 
vector's structure was verified by sequence analysis of the junction 
regions and was subsequently demonstrated to give comparable or 
higher expression of heterologous genes compared with VI. 

O VlJneo EXPRESSION VECTOR. SEP. 10:14 ! 

It was necessary to remove the amp^ gene used for 
antibiotic selection of bacteria harboring VIJ because ampicillin may 
not be used in large-scale fermenters. The ampf gene from the pUC 
backbone of VIJ was removed by digestion with Sspl and Eaml 1051 
restriction enzymes. The remaining plasmid was purified by agarose 
gel electrophoresis, blunt-ended with T4 DNA polymerase, and then 
treated with calf intestinal alkaline phosphatase. The commercially 
available kani* gene, derived from transposon 903 and contained within 
the pUC4K plasmid, was excised using the PstI restriction enzyme, 
purified by agarose gel electrophoresis, and blunt-ended with T4 DNA 
polymerase. This fragment was ligated with the VIJ backbone and 
plasmids with the kani" gene in either orientation were derived which 
were designated as VlJneo #'s 1 and 3. Each of these plasmids was ' 
confirmed by restriction enzyme digestion analysis, DNA sequencing of 
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the junction regions, and was shown to produce similar quantities of 
plasmid as VI J. Expression of heterologous gene products was also 
comparable to VI J for these VlJneo vectors. We arbitrarily selected 
VlJneo#3, referred to as VlJneo hereafter (SEQ. ID: 14:), which 
contains the kan^* gene in the same orientation as the amp^* gene in VI J 
as the expression construct, 

D) VlJns EXPRESSION VECTOR: 

An Sfi I site was added to VlJneo to facilitate integration 
studies. A commercially available 13 base pair Sfi I linker (New 
England BioLabs) was added at the Kpn I site within the BGH sequence 
of the vector. VlJneo was linearized with Kpn I, gel purified, blunted 
by T4 DNA polymerase, and Hgated to the blunt Sfi I linker. Clonal 
isolates were chosen by restriction mapping and verified by sequencing 
through the linker. The new vector was designated VlJns. Expression 
of heterologous genes in VlJns (with Sfi I) was comparable to 
expression of the same genes in VlJneo (with Kpn I). 

E) pGEM-3-IRES : The encephalomyocarditis virus (EMCV) intemal 
ribosomal entry site (IRES) allows efficient expression of two genes 
within a single mRNA transcript when it is juxtaposed between them. 
We have utilized this non-coding gene segment to create dicistronic 
expression vectors for polynucleotide vaccines. The EMCV IRES 
segment was subcloned as a 0.6 kb EcoRl/BssHII digestion fragment 
from the pCITE-1 plasmid (Novagen). This fragment was agarose gel- 
purified, blunt-ended using T4 DNA polymerase and subsequently 
ligated into pGEM-3 (Promega) which had been Xbal-digested, blunt- 
ended with T4 DNA polymerase, and phosphatased. Clones were 
obtained for each of the two possible orientations of this DNA within 
pGEM-3 and each junction site verified by DNA sequencing. The 
preferred orientation for subsequent construction of dicistronic vectors 
positioned the Ncol site within the IRES proximal to BamHI site within 
pGEM-3. This vector is referred to as pGEM-3-IRES. 
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F) pGEM-3-IRES* : A second IRES vector was prepared containing 
mutations in the IRES sequence (IRES*) conferred by a PGR oligomer 
which may optimize IRES-driven expression compared to wild type 
IRES. PGR amplification of IRES* was performed using pGITE-1 
plasmid (Novagen) with the following sense and antisense oHgomers: 
5 -GGT AGA AGA TGT AGT ATA GGG AGA GGG GAA TTG GGG- 
3', SEQ. ID:1 1:, and 5'-GGA GAT AGA TGT GTT GCA TGG TTG 
TGG GAA TAT TAT GAT GG-3\ SEQ. ID:15:, respectively. The 
mutated residue, underlined in the antisense codon, eliminates an 
upstream ATG from the preferred ATG contained within the 
Ncol/Kozak sequence at the 3'-terminal end of the IRES 

G) pGEM-3>IRES/7?£K : HlVnib REV was PGR amplified from 
pGV-1 (catalogue #303, NIH AIDS Research and Reference Program) 
using synthetic oligomers. The sense and antisense oligomers were 5'- 
GGT AGA AGA TGT ACCATGGGA GG A AGA AGG GG A GAG 
AGG-3', SEQ. ID: 16:, and 5'-GGA GAT AGA TGT GAT ATG GGA 
GTA TTG TTT AGG TGG TGA GTG G-3\ SEQ. ID: 17:, respectively. 
These oligomers provide Bglll sites at either end of the translation open 
reading frame as well as an EcoRV site directly upstream from the 
Bglll site at the 3'-terminal end of rcw After PGR, the REV gene was 
treated with Ncol (located within the Kozak sequence) and Bglll 
restriction enzymes and hgated with pGEM-3-IRES which had been 
treated with Ncol and BamHI restriction enzymes. Each ligation 
junction as well as the entire 0.3 kb REV gene was confirmed by DNA 
sequencing. 

H) VlJns-tPA : In order to provide an heterologous leader peptide 
sequence to secreted and/or membrane proteins, VlJn was modified to 
include the human tissue-specific plasminogen activator (tPA) leader. 
Two synthetic complementary oligomers were annealed and then ligated 
into VlJn which had been Bglll digested. The sense and antisense 
oligomers were 5'-GATG AGG ATG G AT GGA ATG AAG AGA GGG 
GTG TGG TGT GTG GTG GTG GTG TGT GGA GGA GTG TTG GTT 
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TCG CCC AGC GA-3', SEQ.IDrlS:, and 5'-GAT CTC GCT GGG ."JA 
AAC GAA GAC TGC TCC ACA CAG CAG CAG CAC ACA GC - 
GAG CCC TCT CTT CAT TGC ATC CAT GGT-3', SEQ. ID: 19;. Tlie 
Kozak sequence is underlined in the sense oligomer. These oligome-:^ 
have overhanging bases compatible for ligation to Bglll-cleaved 
sequences. After Hgation the upstream Bgill site is destroyed while ne 
downstream Bglll is retained for subsequent ligations. Both the 
junction sites as well as the entire tPA leader sequence were verifiec :iy 
DNA sequencing. Additionally, in order to conform with our conseisus 
optimized vector VlJns (=VlJneo with an Sfil site), an Sfil restriciun 
site was placed at the Kpnl site within the BGH terminator region o' 
VlJn-tPA by blunting the Kpnl site with T4 DNA polymerase foUoved 
by ligation with an Sfil linker (catalogue #1 138, New England Bioia^s). 
This modification was verified by restriction digestion and agarose isl 
electrophoresis. 

I) VlJns-HIVTnb/?gV : REV was amplified by PCR as describee 
above for pGEM-3-IRES/7?£:K, digested with Bglll restriction enzyne, 
and ligated into VlJns which had been Bglll- and calf intestinal alkaine 
phosphatase-treated. Ligation junctions were confirmed by DNA 
sequencing and expression of REV was verified by in vitro transfecion 
of RD cells and immunoblot analysis (greater than 1 |Lig REV obtainid 
per 106 cells). 

J) pGEM-3-RRE/IRES/REV : In order to make a cassette consisting of 
the REV response element (RRE) which is required to be on an RJ\ - 
transcript in order for REV-dependent expression to occur, the RRZ 
from HIV strain HXB2 was obtained by PCR using the following 
synthetic oligomers: sense oligomer, 5'-GGT ACA TGA TCA GA" 
ATC GCCC GGG C CGA GAT CTT CAG ACT TGG AGG AGG ^C- 
3', SEQ.ID:20:; and antisense oligomer, 5'-CCA CAT TGA TCA C 
CTT GTG TAA TTG TTA ATT TCT CTG TCC-3', SEQ.ID:21:. 
These oligomers provide Bell restriction sites at either end of the iii'ert 
as well as EcoRV and Srfl sites at the 5 '-end of the insert. The RRI 
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was blunt-end ligated into pGEM-3-/IRES/REV at the Hindi restriction 
site which precedes IRES. The ligation products were verified by 
restriction enzyme mapping and by DNA sequencing across the Hgation 
^ junctions. 

EXAMPLE 2 

gpl20 Vaccines: 

Expression of the REV -dependent env gene as gpl20 was 
conducted as follows: gpl20 was PCR-cloned from the MN strain of 

10 

HIV with either the native leader peptide sequence (VI Jns-gpl20), or as 
a fusion with the tissue-plasminogen activator (tPA) leader peptide 
replacing the native leader peptide (VlJns-tPA-gpl20). tPA-gpl20 
expression has been shown to be /^^^K-independent [B.S. Chapman et aL, 
Nuc. Acids Res. 19, 3979 (1991); it should be noted that other leader 
sequences would provide a similar function in rendering the gpl20 gene 
REV independent]. This was accomplished by preparing the following 
gpl20 constmcts utilizing the above described vectors: 

L gpl20 VACCINE CONSTRUCTS: 

A^ V 1 Jns-tP A-HIVm n gp 1 20 : HIVmN gp 1 20 gene (Medimmune) 
was PCR amplified using oligomers designed to remove the first 30 
amino acids of the peptide leader sequence and to facilitate cloning into 
VI Jns-tPA creating a chimeric protein consisting of the tPA leader 
peptide followed by the remaining gpl20 sequence following amino acid 
residue 30. This design allows for REV -independent gpl20 expression 
and secretion of soluble gpl20 from cells harboring this plasmid. The 
sense and antisense PCR oligomers used were 5'-CCC CGG ATC CTG 
ATC ACA GAA AAA TTG TGGGTC ACA GTC-3\ SEQ. ID:22:, and 
5'-C CCC AGG AATC CAC CTG TTA GCG CTT TTC TCT CTG 
CAC CAC TCT TCT C-3', SEQ. ID:23:. The translation stop codon is 
underlined. These oligomers contain BamHI restriction enzyme sites at 
either end of the translation open reading frame with a Bell site located 
3' to the BamHI of the sense oligomer. The PCR product was 
sequentially digested with Bell followed by BamHI and ligated into 
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VlJns-tPA which had been Bglll digested followed by calf intestinal 
alkahne phosphatase treatment. The resulting vector was sequenced to 
confirm inframe fusion between the tPA leader and gpl20 coding 
sequence, and gpl20 expression and secretion was verified by 
immunoblot analysis of transfected RB cells. Thus, this vector encoding 
the tPA-HIVMN-gpl20 is useful for inclusion in a bi- or tri-cistronic 
construct expressing gag, B7 or other antigens. 

B) Vl-tPA~HIVMN gpl20 : A slightly different version of the chimeric 
tPA-HIVMN gpl20 vector described above was made using an earher 
version of our basic vaccine expression vector, VI (see Nucleic Acid 
Pharmaceuticals patent), which contained a somewhat different tPA 
peptide leader sequence from that described for VlJns-tPA. 

In either of the foregoing PNV constructs, provision of an IRES 
sequence after the translation stop codon, and downstream cloning of 
immunomodulatory genes such as B7, provides bi- or tri-cistronic 
.polynucleotides useful according to the method of this invention. These 
PNV's efficiently express both gene products. 

C) VlJns-tPA-HIVT TTR gpl20 : This vector is analogous to LA. except 
that the HIV niB strain was used for gpl20 sequence. The sense and 
antisense PGR ohgomers used were: 5'-GGT ACA TGA TCA CA GAA 
AAA TTG TGG GTC ACA GTC-3 , SEQ.ID:24:, and 5*-CCA CAT 
TGA TCA GAT ATC TTA TCT TTT TTC TCT CTG CAC CAC TCT 
TC-3', SEQ.ID:25:, respectively. These oligomers provide Bell sites at 
either end of the insert as well as an EcoRV just upstream of the Bell 
site at the 3 '-end. The 5 '-terminal Bell site allows ligation into the 
Bgin site of VlJns-tPA to create a chimeric tPA-gpl20 gene encoding 
the tPA leader sequence and gpl20 without its native leader sequence. 
Ligation products were verified by restriction digestion and DNA 
sequencing. 
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II. IN VITRO gpl20 VACCINE EXPRESSION: 

In vitro expression was tested in transfected human rhabdomyosarcoma 
(RD) cells for these constructs. Quantitation of secreted tPA-gpl20 
from transfected RD cells showed that V IJns-tPA-gp 1 20 vector 
produced secreted gpl20. 

III. IN VIVO gpl20 VACCINATION: 

See figure 12 (mouse data): 

Anti-gpl20 ELISA Titers Elicited 
bv Secreted ppl2n * 

Species GMT (ranged 

mouse 5,310 (1.8 x 103- 1.5 x 10^) 

(post 2 rounds, 200|ig 

per round) 

rabbit 143 (75-212) 

(post 3 rounds, 2 mg 
per round) 

A.G. monkey 171 (< 10-420) 

(post 2 rounds, 
2 mg per round) 

Using VIJns-tPA-gpl20lliB as the inoculation vector, intramuscularly. 
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VlJns-tPA-gpl20M N PNV-induced Class II MHC- 
restricted T lymphocyte gpl20 specific antigen reactivities. Balb/c mice 
which had been vaccinated two times with 200 VlJns-tPA-gpI20MN 
were sacrificed and their spleens extracted for in vitro determinations 
of helper T lymphocyte reactivities to recombinant gpl20. T cell 
proliferation assays were performed with PBMC (peripheral blood 
mononuclear cells) using recombinant gpl20lliB (Repligen, catalogue 
#RP1016-20) at 5 |ig/ml with 4 x 105 cells/ml. Basal levels of 3h- 
thymidine uptake by these cells were obtained by culturing the cells in 
media alone, while maximum proliferation was induced using ConA 
stimulation at 2 |xg/ml. ConA-induced reactivities peak at ~3 days and 
were harvested at that time point with media control samples while 
antigen -treated samples were harvested at 5 days with an additional 
media control. Vaccinated mice responses were compared with naive, 
age-matched syngenic mice. ConA positive controls gave very high 
proliferation for both naive and immunized mice as expected. Very 
strong helper T cell memory responses were obtained by gpl20 
treatment in vaccinated mice while the naive mice did not respond (the 
threshold for specific reactivity is an stimulation index (SI) of >3-4; SI 
is calculated as the ratio of sample cpm/media cpm). Si's of 65 and 14 
were obtained for the vaccinated mice which compares with anti-gpl20 
ELISA titers of 5643 and 1 1 ,900, respectively, for these mice. 
Interestingly, for these two mice the higher responder for antibody gave 
significantly lower T cell reactivity than the mouse having the lower 
antibody titer. This experiment demonstrates that the secreted gpl20 
vector efficiently activates helper T cells in vivo as well as generates 
strong antibody responses. In addition, each of these immune responses 
was determined using antigen which was heterologous compared to that 
encoded by the inoculation PNV (IIIB vs. MN): 
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Splenic T Cell Proliferation Responses to rgp120 Following 
Vaccination with V1Jns-tPA-gp120M N 

Avg. CRM (Stimulation Index) 

5 

Mouse #(agp120 titer)3 Media'' ConA"! Media^ 

rgp1202 

#1 (naive; <10) 339 (1) 185,358 (546) 187 (1) 574 (3) 

10 

#2 (naive; <10) 237 (1) 229,775 (969) 283 (1) 51 1 
(1.8) 

#3 (immune; 5643) 317 (1) 221,003 (697) 354 (1) 
23,109 (65) 

15 

#4 (immune; 11,900) 229 (1)^ 243,427 (1063) 235 (1) 
3384 (14) 

"•Cells harvested on day 4 following 24 hr with 3H-thymidine. 
ConA was used at 2 |ag/ml concentration. 

20 

2Cells harvested on day 5 following 24 hr with ^H-thymidine. 
Recombinant gp120|||B was used at 5 |ig/mi concentration. 



3Anti-gp120(||B reciprocal endpoint ELISA titers and 
proliferation assays performed following 2 rounds of 200 \xg 
DNA/mouse (Balb/c). 

The foregoing data clearly demonstrates efficient in vivo 
expression of relevant HIV antigens with a polynucleotide vaccine 
antigen and elicitation of specific immune responses to the expressed 
gene product. This con.struct is easily modified to form a bi-cistronic 
PNV of this invention by including, dovmstream from the gpl20 
translation stop codon, an second or third cistron encoding REV, B7, 
gag or other antigens unrelated to HIV, such as influenza nucleoprotein 
or hemagglutinin encoding genes. 
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EXAMPLE 3 

gpl60 VACCmES 

In addition to secreted gpl20 constructs, we have prepared 
expression constructs for full-length, membrane-bound gpl60. The 
rationales for a gpl60 construct, in addition to gpl20, are (1) more 
epitopes are available both for both CTL stimulation as well as 
neutralizing antibody production including gp41, against which a potent 
HIV neutralizing monoclonal antibody (2F5, see above) is directed; (2) 
a more native protein structure may be obtained relative to virus- 
produced gpl60; and, (3) the success of membrane-bound influenza HA 
constructs for immunogenicity [Ulmer et al.. Science 259:1745-1749, 
1993; Montgomery, D., et aL. DNA and Cell BioL, 12:777-783, 1993]. 

gpl60 retains substantial REV dependence even with a 
heterologous leader peptide sequence. Therefore, two strategies 
independent from that employed for gpl20 expression were developed 
for preparing a gpl60 expression vector: (1) subcloning into VlJns a 
genomic HIV DNA fragment reported to be effective for heterologous 
gpl60 expression containing tat, REV and gpl60 in entirety (VlJns- 
tatlREVIenv ), [Wang et al., P.N.A.S. USA 90:4156-4160 (May, 1993); 
all of the data reported in that study were generated using bupivacaine 
injection about 24 hours prior to nucleic acid injection. As bupivicaine 
is knowoi to cause muscle damage, this is a regiment that clearly could 
not be used to immunize humans], and (2) PCR-cloning a minimal 
gpl60 ORF into a dicistronic vector before the EMCV internal 
ribosomal entry site (IRES) to efficiently reinitiate translation following 
gpl60 translation for a second cistron encoding REV, This constmct 
ensures effective simultaneous production of both gpl60 and REV 
proteins (VI Jns-gpl60/IRES/r(?v). Each of these vectors has been 
prepared in addition to the monocistronic vectors VlJns-gpl60 and 
VlJns-/?EK. Because there is evidence, in the literature and from our 
own experiments (see below), that the env mRNA requires the tatlREV 
splice donor (SD) site for stability in heterologous expression systems, 
VlJns-gpl60 and VlJns-gpl60/IRES//?£'V were also prepared with this 



wo 95/24485 



PCTAJS95/02633 



- 50 - 



SD inserted upstream of the env ORF. These vaccine constructs were 
prepared as follows. 

T crp 1 60 V AC r-TTsiF CONSTRUCTS: 

' Both gpl60 expression vectors, VlJns-gpl60 and VlJns- 

gpl60/IRES/rev (see A and B below) were prepared with the tat/rev 
splice donor (SD) inserted immediately upstream of gpl60 sequences at 
the PstI site within VlJns (this is the solitary PstI site within both of 
these vectors). Synthetic complementary oligomers encoding the SD 

^° were designed to ligate into the PstI site retaining the original site at the 
5'-end but destroying the PstI site at the 3 '-end of the insert after 
ligation. The oligomer sequences used were: 5'-GTC ACC GTC CTC 
TAT CAA AGC ACT AAG TAG TAG ATG CA-3', SEQ.ID:26: and 
5 -TGT ACT ACT TAG TGC TTT GAT AGA GGA CGG TGA CTG 
CA-3', SEO.ID:27:. The resulting plasmids were verified by restriction 
digestion mapping and by DNA sequencing across the entire SD/PstI 
region. 

A) V1Tn*;-HTVnThgpl60 : HlVmb gp 1 60 was cloned by PGR 
^° amplification from plasmid pF412 which contains the 3'-terminaI half 
of the HIVnib genome derived from HlVmb clone HXB2. The PGR 
sense and antisense oligomers were 5'-GGT AGA TGA TGA ACC ATG 
AGA GTG AAG GAG AAA TAT CAG C-3', SEQ. ID:28:, and 5'-CGA 
CAT TGA TGA GAT ATC CGG ATG TTA TAG CAA AAT CCT TTC 
C-3', SEQ. ID:29:, respectively. The Kozak sequence and translation 
stop codon are underlined. These oligomers provide Bell restriction 
enzyme sites outside of the translation open reading frame at both ends 
of the env gene. (Bcll-digested sites are compatible for hgation with 
Bglll -digested sites with subsequent loss of sensitivity to both restriction 
^° enzymes. Bell was chosen for PCR-cloning gp 160 because this gene 
contains intemal Bgin and as well as BamHI sites). The antisense 
oligomer also inserts an EcoRV site just prior to the Bell site as 
described above for other PCR-derived genes. The amplified gpl60' 
gene was agarose gel-purified, digested with Bell, and hgated to VlJns 
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which had been digested with Bglll and treated with calf intestinal 
alkaline phosphatase. The cloned gene was about 2.6 kb in size and each 
junction of gpl60 with VlJns was confirmed by DNA sequencing. 

B\ VlJns-HIVT TTh gpl60/IRES//?£\/ : pGEM-3-IRES//?E\/ was 
digested with HinDII and Smal restriction enzymes (contained within 
the pGEM-3 multi-linker region) to remove the entire IRES/REV 
sequence (-0.9 kb) and then ligated with VlJns-HIVnibgpl60 which 
had been digested with EcoRV and phosphatased. This procedure 
yielded an 8.3 kb dicistronic VlJns containing gpl60 followed by IRES 
and REV which directs expression of both of these HIV gene products. 
All of the junction regions were verified by DNA sequencing. 

C) VUns-tPA-HIVr rrR gpl60 : This vector is similar to Example 2(C) 
above, except that the full-length gpl60, without the native leader 
sequence, was obtained by PCR. The sense oligomer was the same as 
used in I.C. and the antisense oHgomer was 5'-CCA CAT TGA TCA 
GAT ATC CCC ATC TTA TAG CAA AAT CCT TTC C-3\ 
SEQ.ID:30:. These oligomers provide Bell sites at either end of the 
insert as well as an EcoRV just upstream of the Bell site at the 3*-end. 
The 5*-terminal Bell site allows ligation into the Bglll site of VlJns-tPA 
to create a chimeric tPA-gpl60 gene encoding the tPA leader sequence 
and gpl60 without its native leader sequence. Ligation products were 
verified by restriction digestion and DNA sequencing. 

D) y Uns-tat/rev/envlllB ' This expression vector is pattemed after one 
described by D. Rekosh et al. [Proc. Natl. Acad. Sci. USA, 85, 334 
(1988)] employing a "genomic" segment of an HIV-1 niB clone (HXB2) 
encompassing unspliced tat, rev, and env in their entirety. VlJns was 
digested with Bglll followed by T4 DNA polymerase blunting and calf 
intestinal alkaline phosphatase treatment. A Sall/Xhol fragment of the 
.IIIB genome contained within pF412 was obtained by restriction 
digestion and blunted with T4 DNA polymerase. Ligation products 
were verified by restriction digestion mapping and DNA sequencing. 
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E) VlJn.s-r(?v/c;7\'IIIB : This vector is a variation of the one described in 
section D above except that the entire tat coding region in exon 1 is 

^ deleted up to the beginning of the open reading frame. VlJns- 
gpl60lllB (see section A. above) was digested with PstI and Kpnl 
restriction enzymes to remove the 5'-region of the gpl60 gene. PGR 
ampHfication was used to obtain a DNA segment encoding the first/?£V 
exon up to the Kpnl site in gpl60 from the HXB2 genomic clone. The 
sense and antisense PGR oHgomers were 5 '-GOT AC A GTG GAG TGA 

^° CCG TGG T ATG GGA GGA AG A AGG GGA GAG-3', SEQ.ID:3 1 : 
and 5 -GGA GAT GA GGT AGG GGA TAA TAG AGT GTG AGG-3', 
SEO.ID:32: respectively. These oligomers provide PstI and Kpnl 
restriction enzyme sites at the 5'- and 3'- termini of the DNA fragment, 
respectively. The resulting DNA was digested with PstI and Kpnl, 
purified from an agarose electrophoretic gel, and ligated with VlJns- 
gpl60(PstI/KpnI). The resulting plasmid was verified by restriction 
enzyme digestion. 

II. IN VITRO EXPRESSION OF Ppl60 VAGCrNR: 

RD and 293 cells were transiently transfected with gpl60 
and REV expression constructs. A Western blot analysis shown in Fig. 
5 using an anti-gp41 monoclonal antibody (Ghessie 8, NIH AIDS 
Research and Reference Program #526) showed that gpl60 expression 
by VlJns-gpl60 (SD) required the addition of VlJns-REV (this vector 
produces > 1 |Lig REV/1 0^ cells in transient transfections). VlJns- 
gpl60/IRES/REV efficiently expressed gpl60 without additional REV 
added in trans, confirming function of the dicistron. Similar results 
were found with an anti-gpl20 monoclonal antibody (IGl, Repligen, 
#RP1010-10) for immunoblot visualization. Proteolytic processing of 
gpl60 to the mature gpl20 and gp41 forms was observed for each 
vector. Addition of REV in trans to the dicistronic gpl60/REV vector 
did not result in more gpl60 expression indicating that REV expression 
is not limiting for gpl60 expression in this vector. Expression of gpl60 
improved if the tat/REV SD was included within dicistronic gpl60/REV 
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construct indicating the importance of this site for optimal REV- 
dependent gpl60 expression. We were also surprised to discover that 
dicistronic gpl60/REV expressed more than ten-fold more gpl60 than 
the genomic tat/REV/env construct for transient transfections, again 
demonstrating the high efficiency of this vector for gpl60 expression. 

These vectors provide nucleic acid constructs for gpl60 
plasmid vaccinations with gpl60 and REV genes either on separate 
plasmids or on the same plasmid. In the case of the tpa-gpl60 
construct, REV need not be provided in cis or in trans to achieve 
efficient gpl60 expression, therefore allowing other genes to be 
incorporated in a dicistronic constmct. 

For the REV-dependent constructs, it is important to test 
whether effective gpl60 expression following vaccination requires REV 
to be present on the same plasmid because very small quantities of DNA 
are taken up by muscle cells following intramuscular injection, and 
individual muscle cells (each having hundreds of nuclei) may not 
receive copies of different plasmids in proximal locations within the 
cell. 

^ III, IN VIVO VACCINATION WITH gpl60 VACCINES: 

Three different vector strategies were compared for their 
abilities to induce anti-gpl20 antibody responses in nonhuman primates 
using PNVs encoding gpl60: vaccination with (1) dicistronic 
gpl60/REV using VlJns-gpl60niB/IRES/REV (SD); (2) the genomic 
gpl60 construct VlJns/tat/rev/envlUB; and (3) a mixture of 
monocistronic vectors, VlJns-gpl60TTTR (SD) and VlJns-REV. 
Vaccination doses of 2 mg/animal were used for up to three vaccination 
rounds which were delivered at one month intervals while 

^ simultaneously obtaining bleeds. Anti-gpl20 ELISA titers using 

recombinant gpl20lllB are shown for monkeys vaccinated with each of 
these vectors. Dicistronic gpl60/REV ehcited antibody responses in 
both rhesus and African Green monkeys while the genomic gpl60 and 
mixed monocistronic vectors did hot elicit detectable antibodies after 
two rounds of vaccination (i.e., one month following the second 
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vaccination). All four monkeys which received dicistronic gpl60/REV 
also showed specific anti-gp41 reactivities as measured by the BIAcore 
assay using recombinant gp41 (ABT) as the immobilized substrate (data 
not shown). The sera obtained from these monkeys also showed anti- 
V3IIIB ELISA reactivities with titers ranging from -^50 - 100. These 
results prove that in vivo expression induced by PNV for multiple 
cistrons is not analogous to results obtained by in vitro transfection 
methods in which gpl60 expression was shown for all three vector 
strategies. Note especially that in vitro transfection resulted in 
equivalent expression by the mixed monocistronic gpl60 and REV 
vectors as compared to dicistronic gpl60/REV (see Fig. 5). These 
experiments prove that our dicistronic PNVs do deliver effective 
coordinate expression following in vivo vaccination while other 
methods of vaccination with multiple cistrons were unable to do so. 
See figure 9, showing two African Green Monkeys and two rhesus 
monkeys and one rabbit's immune responses. 
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Anti-qp120 ELISA Titers Elicited bv qdISO PNVr 
in Non-Human Primates 
2 mg DNA per round 

^ Titer 

Vector/Species post 211^ post rild 

^/^^}r^s-tat/rev/env mp : 

African Green (#1) <20 ND"! 

10 

(#2) <20 ND 

(#3) <20 ^D 

V1Jns-gp1 60 iii £ + \/^Jns-rev ^: 

African Green (#1) <20 ND 

(#2) <20 IVD 

(#3) <20 ^D 

V1 Jns-gp160|j2 ^/IRES//-ev^ : 

African Green (#1) 85 9 0 

(#2) 75 60 

Rhesus (#1) 165 175 

(#2) 290 260 

^ND = not determined. 

2This PNV represents an equimolar mixture of the two monocistronic 
vectors. 
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Anti-V3|||p ELISA Titers Elicited bv gpieo/rei/ Dicistron * 
in Non-Human Primates. 2 ma DNA per round 

Species (animal #^ Titer (post 3l d . vaccination^ 

African Green (#1) 7 0 

(#2) 45 

Rhesus (#1) 55 
(#2) 100 

*Using V1Jns-gp1 60||iB/IRES/rev/ as the inoculation vector. 

EXAMPLE 4 

SIV VACCINES 

An SIV env construct, VlJn-SIV gpl52, was made by 
PCR-cloning from a genomic clone of the SIVMAC251 virus isolate and 
confirmed by DNA sequencing of both junctions with the vector. This 
strain is homologous to the virus which is used at the New England 
Regional Primate Center (NRPC) for infectious SIV challenges to 
rhesus monkeys. A similar SIV gpl52 construct is prepared in which 
the DNA encoding the leader peptide region uses alternative codons but 
which retains the native amino acid sequence. This reduces the REV- 
dependence of this construct and makes a more stable mRN A transcript. 
These vaccine constructs were prepared as follows. 

I. SIV VACCINE CONSTRUCTS: 

A). V1J-SIVMAC251 p28 gai^ : The central peptide of 
SIV gaii , referred to as p28 gag , was chosen for a polynucleotide 
vaccine to test for CTL generation in nonhuman primates. This region 
of gag encodes a known CTL epitope for macaque monkeys which 
have the MHC Class I haplotype known as Mamu-AOl. Thus, monkeys 
bearing this haplotype should demonstrate CTL reactivity this gag 
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epitope after vaccination with the appropriate ga}^ plasmid. While both 
SIV and HIV ^a<^ genes contain regulatory sequences which are REV 
dependent, p28 gag expression appears to be less REV -dependent so 
that at least some expression may be achieved in the absence of REV . 

SIV p28 gag was cloned into expression vectors V 1 using 
Bglll restriction enzyme sites after PCR amplification from the plasmid 
p239SpSp5* (obtained from the NIH AIDS Research and Reference 
Program, catalogue #829) using custom synthetic 
oligodeoxyribonucleotides. This plasmid encodes the 5'- half of the 
SIVMAC239 genome. SIVmAC 239 is a subsequent in vitro passage 
line of SIVMAC251 which has undergone some mutations compared to 
the parental virus. However, the amino acid sequences between these 
vimses are identical for p28 gag. The PCR sense and antisense 
oligomers were 5'-GGT ACA AGA TCT ACC ATG GGA CCA GTA 
CAA CAA ATA GGT GGT AAC-3' , SEQ. ID:33:, and 5'-CCA CAT 
AGA TCT TTA CAT TAA TCT AGC CTT CTG TCC C-3', SEQ. 
ID:34:. These oligos provide Bglll restriction enzyme sites outside the 
translational open frames, a consensus Kozak translation initiation codon 
context (underlined) and translation stop codon (underlined). PCR- 
generated p28 gag was agarose gel-purified, digested with Bglll and 
ligated into Bglll-treated, phosphatased VI. This gene was subsequently 
subcloned into our optimized expression vector, VIJ, using Bglll 
restriction enzyme sites and designated as VIJ-SIV p28 gag. The cloned 
gene was about 0.7 kb long. The junction sites of the VIJ CMV 
promoter and 5 'terminus of p28 gag were verified by DNA sequence 
analysis for each constmct. In vitro expression of SIV p28 protein was 
compared for VIJ and VI constmcts by Western blotting using plasma 
from an SIV-infected macaque monkey to detect gag protein. The VIJ- 
SIV p28 gag construct consistently gave the most product at the 
appropriate molecular weight position. Similar and even improved 
results are obtained with the more optimized Vljneo, VlJns and VI R 
vectors. 
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B). V1J-SIVMAC251 SIV /jc/ was cloned after PGR 

amplification from the plasmid pBK28 which encodes the entire 
SIVMAC251 genome (a gift from Dr. Vanessa Hirsch, NIAID, NIH, 
Rockville, MD; now listed as catalogue #133, NIH AIDS Research and 
Reference Program). The PGR sense and antisense oligomers were 5'- 
GGT AGA AGG ATG GGT GGA GGT ATT TGG ATG AGG-3', SEQ. 
ID:35: and 5'-GGT AGG TTA GGG TTG TTG TAA GGT GTT GG-3', 
SEQ. ID:36:. The Kozak site and translation stop codon are underlined. 
The amplified nef gene was agarose gel-purified, blunt-ended using T4 
DNA polymerase, phosphorylated at the 5'-terminus using T4 DNA 
kinase, and cloned into a blunted Bglll restriction enzyme site of VIJ 
which had been phosphatased using calf intestinal alkaline phosphatase. 
The cloned gene was about 0.76 kb long. The junction site of the VIJ 
GMV promoter and 5'-terminus of nef was confirmed by DNA 
sequencing. In vitro expression was shown using Western blot analysis 
and an HIV nef antiserum (catalogue #331, NIH AIDS Research and 
Reference Program). 

G). VlJn-SIVMAG251 gpl52: SIV env , referred to as 
gpl52, was cloned after PGR amplification from the plasmid pBK28 
into VlJneo (see Nucleic Acid Pharmaceuticals patent). The PGR 
sense and antisense oligomers were 5'-GGT AGA AGA TGT AGG ATG 
GGA TGT GTT GGG AAT GAG G-3', SEQ. ID:37: and 5'-GGA GAT 
AGA TGT GAT ATG GTA TGA GTG TAG TGG AAA TAA GAG G- 
3', SEQ.ID:38. The Kozak site and amber translation stop codon are 
underlined. The PGR product has Bglll restriction enzyme sites outside 
the translation open reading frame at both ends with an additional 
EcoRV site immediately preceding the 3'-terminal Bglll site but after 
the amber stop codon. This provides a convenient restriction enzyme 
site for subsequent cloning steps. The amplified gpl52 gene was 
agarose gel-purified, Bglll-digested and ligated with VlJn which had 
.been Bglll-digested and phosphatased. The cloned gene was about 2.2 
kb long. The junctions at each end of gpl52 with VlJn GMV promoter 
and BGH terminator regions were verified by DNA sequencing. 
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IL IN VIVO VACCINATION WITH SIV VACCINES: 

Two SIV gene constructs have been used for vaccination of 
rhesus monkeys and have been shown to generate specific CTL 
responses in non-human primates (see figure 4). 

VIJ-SIV p28 gag, which expresses the relatively REV- 
independent central peptide of gag, and VIJ-SIV nef were i.m.-injected 
into Macaca mulatta monkeys at 3mg/vaccination for three injection 
rounds spaced one month apart. The ,^a^'-specific CTL response of 
rhesus monkeys with the Mamu-AOl MHC I haplotype is restricted 
primarily to a single peptide epitope within p28 gag, Mamu-AOl + 
monkeys receiving VIJ-SIV p28 gag had ^a^^-specific CTL activity 
beginning at one month after the second injection while Mamu-AOl " 
monkeys receiving this DNA as well as monkeys receiving VI J control 
DNA did not show a CTL response. Both in vitro, gag peptide- 
restimulated CTL as well as primary CTL were detected after the 
second and third vaccination rounds, respectively. These CTL activity 
levels were comparable to those generated by vdccinra-gag inoculation. 
Subsequently, the CTL levels declined in responding animals. These 
animals are re-vaccinated to boost the initial CTL response. VIJ-SIV 
ne/*- vaccinated animals have not shown a specific CTL response, 
although a more refined assay, such as the one used for gag CTL 
detection, (i.e., no dominant MHC I haplotype/ne/ peptide relationship 
has been defined for rhesus monkeys so that peptides of unknown 
effectiveness are used for stimulation, and there is no positive control), 
may provide a different result, 

EXAMPLE 5 
OTHER VACCINE CONSTRUCTS 
° A. V 1 Jns-HI Vthr Qa 9/noI-RRE/lRES/R R V : Dicistronic 

expression vectors encoding gag with or without the protease region of 
pol were made by PCR amplification of HIVlUB gag-pol sequences with 
several variations. Inclusion of the protease (prt) segment of pol 
allows proteolytic processing of gag into various peptides (e.g., pl7. 
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p24, pl5, etc.) which comprise the mature capsid particles while 
omission of this enzyme results in p55 synthesis in the form of 
immature capsid particles. More extensive sequences of pol were not 
included to avoid potential safety hazards that may be associated with 
the reverse transcriptase and integrase enzymatic activities of pol. For 
^'fl^' capsid particles, whether processed into the mature forms or not, to 
be extruded from cells myristoylation of the glycine amino acid at 
position two following the initial Met must occur. Mutagenesis of this 
glycine residue abrogates myristoylation and no gaf> particles are 
extruded from the cell. These modifications of gai^ allow us to 
determine whether either generation of ds\X\-gag CTLs following 
vaccination with such gag vectors is affected by proteolytic processing 
and/or extrusion of capsid from cells. Some of the vectors listed below 
contain a splice donor (SD) site that is found upstream of the gag open 
reading frame. These vectors allow us to determine whether this SD is 
necessary for optimum rev-dependent expression of gag as was 
inclusion of the tat/rev SD for optimum gpl60 expression. 

n VlJns-HTVT Tm PgP-nr//RRF,/TRF..S/RF.V - Agag-prt 
encoding DNA segment was obtained by PGR amplification using the 
following sense and antisense oHgomers: 5'-GGT ACA GGA TCC ACC 
ATG GGT GCG AGA GCG TCA GTA TTA AGC-3', SEO.ID:39: and 
5 -CCA CAT GGA TCC GC CCG GGC TTA CAT CTC TGT ACA 
AAT TTC TAC TAA TGC-3', SEQ.ID:40:, respectively. These 
oligomers provide BamHI restriction enzyme sites at either end of the 
segment, a Kozak initiation of translation sequence including an Ncol 
site, and an Srfl site immediately upstream of the BamHI site at the 3'- 
terminus. The Srfl site was used to clone the RRE/IRES/REV cassette 
from pGEM-3-RRE/IRES/REV, which was excised using the EcoRV 
restriction enzyme, immediately downstream of gag-prt. All Hgation 
junctions were DNA sequence verified and the construct was further 
verified by restriction enzyme mapping. 

2) VlJns-HTVTTTR t/flrg-/7;Y/RRR/TRFS/REVrSD^ : This 
vector was prepared exactly as vector 1 above except that the PCR sense 
oligomer used was 5'-GGT ACA GGA TCC CCG CAC GGC AAG 
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AGG CGA GGG-3', SE0.ID:41:. This allows inclusion of the upstream 
SD site at the beginning of the ^ai^ sequence. This construct was 
verified by restriction enzyme mapping and DNA sequencing of the 
hgation junctions. 

3) VlJns-HIV TTTR g^g-pr//RRE/IRES/REV(w/o 
myristoylation) : This vector is prepared exactly as vector 1 above 
except that the PGR sense oligomer used was 5*-GGT ACA GGA TCC 
ACC ATG GCT GCG AG A GCG TCA GTA TTA AGC-3', SEQJD:42. 

4) VlJns-HIVT TTR g^g/RRE/IRES/REV : This vector is 
0 

prepared exactly as vector 1 above except that the PGR antisense 

oligomer used was 5''GGA GAT GGA TGC GGG GGG GGG TTT ATT 
GTG AGG AGG GGT GGT TGG-3\ SEO.ID:43. 

5) VlJns-HIVT Tm i^/RRE/IRES/RE V TSD^ : This vector 
is prepared exactly as vector 4 above except that the PGR sense 
oligomer used was 5'-GGT AGA GGA TGG GGG GAG GGG AAG 
AGG GGA GGG-3\ SEQ.ID:44. 

6) VlJns-HIVT rm e^g/RRE/IRES/REV (w/o 
myristoyiation') : This vector is prepared exactly like vector 5 except 
that the PGR sense oligomer used was 5*-GGT AGA GGA TGG AGG 

° ATG GGT GGG AGA GGG TGA GTA TTA AGG-3\ SEQ.ID:45. 

B. VlJns-HIV nef : This vector uses a n<?/ gene from a 
viral strain representative of those in the infected population using sense 
^ and antisense PGR oligomers analogous to those used for SIV nef. 

G. pGEM-3-X-IRES-B7 : (where X = any antigenic gene) 
As an example of a dicistronic vaccine constmct which provides 
coordinate expression of a gene encoding an immunogen and a gene 
^ encoding an immunostimulatory protein, the murine B7 gene was PGR 
amplified from the B lymphoma cell line GHl (obtained from the 
ATGG). B7 is a member of a family of proteins which provide essential 
postimulation T cell activation by antigen in the context of major 
histocompatibility complexes I and II. GHl cells provide a good source 
of B7 mRNA because they have the phenotype of being constitutively 
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activated and B7 is expressed primarily by activated antigen presenting 
cells such as B cells and macrophages. These cells were further 
stimulated in vitro using cAMP or IL-4 and mRNA prepared using 
standard guanidinium thiocyanate procedures. cDNA synthesis was 
^ performed using this mRNA using the GeneAmp RNA PGR kit (Perkin 
-Elmer Getus) and a priming oligomer (5'-GTA GCT GAT GAG GGA 
GAT AAT AGG ATG-3\ SE0.ID:46:) specific for 37 located 
downstream of the B7 translational open reading frame. B7 was 
amplified by PGR using the following sense and antisense PGR 
oligomers: 5'-GGT AGA AGA TGT AGG ATG GGT TGG AAT TGT 
GAG TTG ATG G-3', SE0.ID:47:, and 5'-GGA GAT AGA TGT GGA 
TGG GAA GTA A AG GAA GAG GGT GTG TTG-3', 
SEO.ID:48:, respectively. These oligomers provide Bglll restriction 
enzyme sites at the ends of the insert as well as a Kozak translation 

15 

initiation sequence containing an Ncol restriction site and an additional 
Ncol site located immediately prior to the 3'-terminal Bglll site, Ncol 
digestion yielded a fragment suitable for cloning into pGEM-3-IRES 
which had been digested with Ncol. The resulting vector, pGEM-3- 
IRES-B7, contains an IRES-B7 cassette which can easily be transferred 

20 

to VlJns-X, where X represents an antigen-encoding gene. 

D. pGEM-3-X-IRES-GM-GSF : (where X = any antigenic 
gene) This vector contains a cassette analogous to that described in item 
G above except that the gene for the immunostimulatory cytokine, GM- 
GSF, is used rather than B7. GM-GSF is a macrophage differentiation 
and stimulation cytokine which has been shown to elicit potent anti- 
tumor T cell activities in vivo [G. Dranoff et al., Proc. Natl. Acad. Sci. 
USA, 90, 3539 (1993). 

E. pGEM-3-X-IRES-IL-12 : (where X = any antigenic 
gene) This vector contains a cassette analogous to that described in item 
G above except that the gene for the immunostimulatory cytokine, IL- 
12, is used rather than B7. IL-12 has been demonstrated to have an 
influential role in shifting immune responses towards cellular, T cell- 
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dominated pathways as opposed to humoral responses [L. Alfonso et ah. 
Science, 263, 235, 1994]. 

F. VlJns-HIVv gpl60/IRES/rgvT TrR (SD) : This vector is 
analogous to the one described in LB. above except that gpl60 genes 
derived from various clinical strains are used rather than gpl60 derived 
from laboratory strain IIIB. 

Ck VlJns-PR8/34/HA-IRES-SIVp28 gag 
This construct provides an influenza hemagglutination gene 
(HA) in concert with the SIV p28 ^ag gene for coordinate expression 
via the IRES element. The PR8/34/HA gene was amplified by PGR 
using the following sense and antisense oligomers: 5'-GGT AC A AG A 
TCT ACC ATG AAG GCA AAC CTA CTG GTC CTG-3', 
SEO.ID:49:, and 5'-CCA CAT AGA TCT GAT ATC CTA ATC TCA 
GAT GCA TAT TCT GCA CTG C-3', SEQ.ID:50:, respectively. The 
resulting DNA segment has Bglll restriction enzyme sites at either end 
. and an EcoRV site at the 3'-terminus. After Bglll digestion this gene 
was cloned into VlJns which had been digested with Bglll followed by 
alkaline phosphatase treatment, SIV p28 gag was excised from VIJ-SIV 
p28 gag by Ncol and Bglll digestion. pGEM-IRES was digested with 
Ncol and BamHI for directional ligation with p28 ^a^j/NcoI/Bglll. The 
IRES-p28 gag cassette is removed by restriction digestion with Smal 
and Hindu and ligated into the EcoRV site of VlJns-A/PR8/HA. In 
vivo coordinate expression of these genes allows generation of potent 
antibody responses by PNV vaccination (HA), with requisite T cell help, 
which provides such help in a local environment to potentiate the CTL 
response of the second gene product (SIV p28 gag). This construct also 
demonstrates the ability to use the PNV and method of this invention to 
generate immune responses against multiple antigens whether or not 
related to HIV. Those skilled in the art will appreciate that this type of 
construct could be mixed with other, bi- or tri-cistronic constmcts to 
produce a multivalent combination polynucleotide vaccine. 
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H. VlJns-tPA-gpl6Q mR /IRES/SIV p2R ^gq : VlJns-tPA- 
gpl60llIB was digested with EcoRV, treated with calf intestinal alkaline 
phosphatased, and ligated with IRES-SIV p28 gag which had been 
removed from pGEM-3-IRES-SIV p28 by restriction enzyme digestion 
using Smal and Hindll. In vivo coordinate expression of these genes 
allows coupling a protein which generates strong helper T cell responses 
(gpl60) to one which provides Class I MHC-associated CTL epitopes 
(SIV p28 gag). This vaccine is designed for immunization of rhesus 
monkeys for generation of anti-^'nv neutralizing antibodies and CTL as 
well as anti-SIV gag CTL. These monkeys are subsequently challenged 
with appropriate SHIV viral challenges [see J. Li et al., J. A.LD.S. 5. 
639-646 (1992)]. 

L VlJns-tPA-gpl20 TTrR /IRES/SIV p28 9av : This vector 
is constructed exactly as VlJns-tPA-gpl6Qn TR /IRES/SIV p28 gai^ 
except that VI Jns-tPA-gpl20lllB is used in place of the gpl60 gene. 
Vaccination and SHIV challenge are conducted as described above. 

J. VlJns-tPA-gpl20T TTR /IRES/HIVg^i^ /IRES//-^v : This 
vector is similar to those described above except that a tricistron 
provides gag and rev expression in addition to gpl20. 

K. VlJns-tPA-gpl60T TTR /IRES/HIV mJESIrev : This 
vector is similar to those described above except that a tricistron 
provides gag and rev expression in addition to gpl60. 

EXAMPLE 6 

ASSAY FOR HIV CYTOTOXIC T-LYMPHOCYTES: 

The methods described in this section illustrate the assay as 
used for vaccinated mice. An essentially similar assay can be used with 
primates except that autologous B cell lines must be established for use 
as target cells for each animal. This can be accomplished for humans 
using the Epstein-Barr vims and for rhesus monkey using the herpes B 
virus. 
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Peripheral blood mononuclear cells (PBMC) are derived 
from either freshly drawn blood or spleen using Ficoll-Hypaque 
centrifugation to separate erythrocytes from white blood cells. For 
mice, lymph nodes may be used as well. Effecter CTLs may be 
prepared from the PBMC either by in vitro culture in IL-2 (20 U/ml) 
and concanavalin A (2|Lig/ml) for 6-12 days or by using specific antigen 
using an equal number of irradiated antigen presenting cells. Specific 
antigen can consist of either synthetic peptides (9-15 amino acids 
usually) that are known epitopes for CTL recognition for the MHC 
haplotype of the animals used, or vaccinia vims constmcts engineered to 
express appropriate antigen. Target cells may be either syngenic or 
MHC haplotype-matched cell lines which have been treated to present 
appropriate antigen as described for in vitro stimulation of the CTLs. 
For Balb/c mice the PI 8 peptide 

(ArglleHisIleGlyProGlyArgAlaPheTyrThrThrLysAsn, SEQ.ID:51:, for 
HIV MN strain) can be used at 10 \xM concentration to restimulate CTL 
in vitro using irradiated syngenic splenocytes and can be used to 
sensitize target cells during the cytotoxicity assay at 1-10 |llM by 
incubation at 37^C for about two hours prior to the assay. For these 
H-2d MHC haplotype mice, the murine mastocytoma cell line, P815, 
provides good target cells. Antigen-sensitized target cells are loaded 
with Na51Cr04, which is released from the interior of the target cells 
upon killing by CTL, by incubation of targets for 1-2 hours at 37^C 
(0.2 mCi for -5 x 10^ cells) followed by several washings of the target 
cells. CTL populations are mixed with target cells at varying ratios of 
effectors to targets such as 100:1, 50:1, 25:1, etc., pelleted together, and 
incubated 4-6 hours at 37^C before harvest of the supematants which 
are then assayed for release of radioactivity using a gamma counter. 
Cytotoxicity is calculated as a percentage of total releasable counts from 
the target cells (obtained using 0.2% Triton X-100 treatment) from 
which spontaneous release from target cells has been subtracted. 
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EXAMPLE 7 
ASSAY FOR HIV SPECIFIC ANTIBODIES: 

ELISAs were designed to detect antibodies generated 
against HIV using either specific recombinant protein or synthetic 
peptides as substrate antigens. 96 well microtiter plates were coated at 
40c overnight with recombinant antigen at 2 |ig/ml in PBS (phosphate 
buffered saline) solution using 50 jxl/well on a rocking platform. 
Antigens consisted of either recombinant protein (gpl20, rev: Repligen 
Corp.; gpl60, gp41: American Bio-Technologies, Inc.) or .synthetic 
peptide (V3 peptide corresponding to virus isolate sequences from IIIB, 
etc.: American Bio-Technologies, Inc.; gp41 epitope for monoclonal 
antibody 2F5). Plates were rinsed four times using wash buffer 
(PBS/0.05% Tween 20) followed by addition of 200|al/well of blocking 
buffer (1% Carnation milk solution in PBS/0.05% Tween-20) for 1 hr 
at room temperature with rocking. Pre-sera and immune sera were 
diluted in blocking buffer at the desired range of dilutions and 100 |il 
added per well. Plates were incubated for 1 hr at room temperature 
with rocking and then washed four times with wash buffer. Secondary 
antibodies conjugated with horse radish peroxidase, (anti-rhesus Ig, 
Southern Biotechnology Associates; anti- mouse and anti-rabbit Igs, 
Jackson Immuno Research) diluted 1 :2000 in blocking buffer, were then 
added to each sample at 100 )Lil/well and incubated 1 hr at room 
temperature with rocking. Plates were washed 4 times with wash buffer 
and then developed by addition of 100 |j.l/well of an o-phenylenediamine 
(o-PD, Calbiochem) solution at 1 mg/ml in 100 mM citrate buffer at pH 
4.5. Plates were read for absorbance at 450 nm both kinetically (first 
ten minutes of reaction) and at 10 and 30 minute endpoints (Thermo- 
max microplate reader, Molecular Devices). 

EXAMPLE R 
ASSAY FOR HIV NEUTRALIZING ANTIBODIES: 

In vitro neutralization of HIV isolates assays using sera 
derived from vaccinated animals was performed as follows. Test sera 
and pre-immune sera were heat inactivated at 560c for 60 min before 
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use. A titrated amount of HIV-1 was added in 1:2 serial dilutions of test 
sera and incubated 60 min at room temperature before addition to 10^ 
MT-4 human lymphoid cells in 96 well microliter plates. The virus/cell 
mixtures were incubated for 7 days at 37^C and assayed for virus- 
mediated killing of cells by staining cultures with tetrazolium dye. 
Neutralization of virus is observed by prevention of virus-mediated cell 
death. 



EXAMPLE 9 

PROTECTION OF CHIMPANZEES UPON CHALLENGE WITH 
VIRULENT HIV-1: 

The only animal HIV challenge model to date is with 
chimpanzees. While chimpanzees do not develop HIV-related 
immunodeficiency disease they can be infected with some HIV viral 
isolates. The most common strain used to date in this model is the IIIB 
strain (BHIO) although challenge stocks for other isolates are being 
developed, e.g., for SF2. We envision vaccination of chimpanzees in an 
analogous manner to vaccination in other nonhuman primates using HIV 
env and gag-pol constructs derived from the HIV-1 IIIB strain (HXB2 
clone) as described within this document to achieve anti-HIV humoral 
and cellular responses. While the BHIO challenge virus for 
chimpanzees is IIIB derived as are our vaccination construct genes, 
there is heterogeneity within this virus so that HXB2 is only one of at 
least three variations of IIIB present in the viral inoculum. Thus, the 
IIIB challenge experiment of HXB2 gene vaccinated monkeys is not 
completely homologous. 

We are vaccinating chimpanzees 3-5 rounds with 
polynucleotide HIV gene vaccines with doses of 0. 1 -3 mg of 
plasmid/round. After characterization of vaccine-induced humoral and 

30 

CTL anti-HIV responses these monkeys are challenged with 10 to 140 
CID50 (50% chimpanzee infectious dose) by an intravenous 
administration of HIV-lniB inoculum diluted 1:25 in physiologic saline 
just prior to use. Infection of chimpanzees is monitored by detection of 
HIV-1 virus specific DNA sequences using DNA derived from PBMC 
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obtained from test chimpanzees, (see Example 10 for details). Vaccine- 
mediated protection can be described as a range of responses to 
challenge virus from complete sterilizing immunity (inability to detect 
virus post infection) to significant reductions and/or delay in viremia 
induced by the challenge stock. While sterilizing immunity is clearly the 
most preferred response to vaccination, reduced or delayed viremia 
may significantly influence onset of immunodeficiency disease in human 
vaccinees. 

EXAMPLE 10 

ISOLATION OF GENES FROM CLINICAL HIV ISOLATES: 

HIV viral genes were cloned from infected PBMC's which 
had been activated by ConA treatment. The preferred method for 
obtaining the viral genes was by PCR amplification from infected 
cellular genome using specific oligomers flanking the desired genes. A 
second method for obtaining viral genes was by purification of viral 
RNA from the supematants of infected cells and preparing cDNA from 
this material with subsequent PCR. This method was very analogous to 
that described above for cloning of the murine B7 gene except for the 
PCR oligomers used and random hexamers used to make cDNA rather 
than specific priming oligomers • 

Genomic DNA was purified from infected cell pellets by 
lysis in STE solution (10 mM NaCU 10 mM EDTA, 10 mM Tris-HCl, 
pH 8.0) to which Proteinase K and SDS were added to 0.1 mg/ml and 
0.5% final concentrations, respectively. This mixture was incubated 
overnight at 56^C and extracted with 0.5 volumes of 
phenol :chloroform:isoamyl alcohol (25:24:1). The aqueous phase was 
then precipitated by addition of sodium acetate to 0.3 M final 
concentration and two volumes of cold ethanol. After pelleting the 
DNA from solution the DNA was resuspended in 0.1 X TE solution (IX 
TE = 10 mM Tris-HCl, pH 8.0, 1 mM EDTA). At this point SDS was 
added to 0. 1 % with 2 U of RNAse A with incubation for 30 minutes at 
370c. This solution was extracted with phenol/chloroform/isoamyl 
alcohol and then precipitated with ethanol as before. DNA was 
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suspended in 0.1 X TE and quantitated by measuring its ultraviolet 
absorbance at 260 nm. Samples were stored at -2iPC until used for 
PGR. 

PGR was performed using the Perkin-Elmer Getus kit and 
procedure using the following sense and antisense oligomers for gpl60: 
5'-GA AAG AGG AGA AGA GAG TGG GAA TGA -3\ SEQ.ID:52: 
and 5 -GGG GTT TGG TAA ATG GGT GGG AAG TGG GGG GGG C 
ATG TGG-3', SEQ.ID:53:, respectively. These oligomers add an Srfl 
site at the 3*-terminus of the resulting DNA fragment. PGR-derived 
segments are cloned into either the VlJns or VIR vaccination vectors 
and V3 regions as well as ligation junction sites confirmed by DNA 
sequencing. 
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EXAMPLE 1 1 

SEQUENCES ACROSS VACCINE CONSTRUCT JUNCTIONS: 
Genes were cloned according to Example 10. In each case, the junction 
sequences from the 5* promoter region (CMVintA) into the cloned gene 
was sequenced using the primer: 

GMVinta primer 5 - CTA AGA GAG TGT TGG TTT CCA TG- 3', 
SEQ. ID:54:, which generates the sequence of the coding sequence. 
This is contiguous with the terminator/coding sequence, the junction of 
which is also shown. This sequence was generated using the primer: 
BGH primer 5'- GGA GTG GCA GGT TGG AGG -3\ SEQ. ID:55:, 
which generates the sequence of the non-coding strand. In every case, 
the sequence was checked against known sequences from GENBANK 
for cloned and sequenced genes from these or other HIV isolates. The 
position at which the junction occurs is demarcated by a which does 
not represent any discontinuity in the sequence. The first "ATG" 
encountered in each sequence is the translation initiation codon for the 
respective cloned gene. Each sequence provided represents a complete, 
available, expressible DNA construct for the designated HIV gene. The 
nomenclature follows the convention: "Vector name-HIV strain-gene". 
The biological efficacy of each of these constructs is shown in the same 
manner as in the foregoing Examples: 
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SEQUENCE ACROSS THE 5' JUNCTIONS OF CMVintA AND THE 
HIV GENES AND ACROSS THE 3' JUNCTIONS OF THE HIV 
GENES AND THE BGH TERMINATOR EXPRESSION 
^ CONSTRUCTS, USING DIFFERENT HIV STRAINS AND 
PROTEINS: 

1 VlIns-rgv iiiB: 

SEO.ID:56: 

5 -GGA GAG AGC GACGAA GAG CTG CTC AAG GCA GTC AGA CTC ATC AAG-3' 
rev .... 

(Sequence begins at the 5 - terminus within the PCR oligomer. See #7 below 
for complete rev 5'- terminus sequence) 

SEO.ID:57: 

5'-GAT GGC TGG CAA CTA GAA GGC ACA GCA GAT CT/ GAT ATC GCA CTA 
BGH rev.., 

TTC TTT AGC TCC TGA CTC CAA TAT TGT-3' 
2. V1.lns-Pn160iTfK 

SEO.ID:58: 

5'-CTT AGA TCI A ACC ATG AGA GTG AAG GA GAA ATA TCA GCA CTT GTG 
CMVinta gpl60 

GAG ATG GGG GTG GAG ATG GGG CAC CAT GCT CCT TGG GAT GTT GAT GAT 
CTG TAG TGC TAC AGA AAA ATT GTG GGT-3' 

SEQ.ID:59: 

5 -CTG GCA ACT AGA AGG CAC AGC AGA TC/ A GAT ACT GTC CCC ATC TTA 
BGH gpl60 

TAG CAA AAT CCT TTC CAA GCC CTG TCT TAT TCT-3' 
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3. pGEM-3-rRES: [sequenced using SP6 (5'-GAT TTA GGT GAC ACT 
ATA G-3', SEQ.ID:60:) and T7 (5'-TAA TAG GAC TCA CTA TAG GG-3', 
SEQ.ID:61:) primers, Promega Biotech] 

SEO.ID:62: 

5'-CAT GCC TGC AGG TCG ACT CTA/ AAT TCC G... 
pGEM-3 (SP6) IRES 

SEO.ID:63: 

5 - A CCC GGG GAT CCT CT/ A GCG CGC TTG TCT CTT GTT CCA... 
pGEM-3 (T7) IRES 

4. DGEM-3-IRES/rgv iirB . [sequenced using T7 sequencing primer 
(Promega) for rev 3'-end, and] IRES 3'- oligomer (5'-GG GAC GTG GTT 
TTC C-3', SEO.ID:64:) for IRES/r^v junction] 

SEQ.ID:65: 

5 -TAT GGC CAC AAC C/ AT GGC AGG AAG AAG CGG AGA CAG CGA CGA AGA 

IRES rev 
CCT CCT CAA GGC AGT CAG ACT -3' 

SE0.ID:66: 

5'-CTC GAG CCA TGG GCC CCT/ AGA CTA TAG CGT GAT AAG AAA TCG AGG 
pGEM-3 rev 

ACT GAG GTT ATA ACA TCC TCT AAG GTG GTT ATA AAC TCC CGA AGG-3' 

5- pGEM-3-RRE/IRES/rgv inB : [using SP6 sequencing oligomer 
(Promega) and IRES 5 - oligomer, 5'-G CTT CGG CCA GTA ACG-3', 
SEO.ID:67:] 

SEO.ID:68: 

5 -TTG CAT GCC TGC AGG T/ GGT ACA TGA TCA GAT ATC G CCC GGG / C 
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pGEM-3 RRE 

CGA GAT CTT CAG ACT TGG AGG AGG AGA TAT GAG GGA CAA TTG GAG-3' 
IRES-5' 

SEO.ID:69: 

5'-GGG GCG GAA TT/ T AGA GTC A/ ATT GAT CAG CTT GTG TAA TTG TTA 

RRE-3' 

ATT TCT CTG TCC CAC TCC ATC CAG GTC GTG TGA TTC...-3' 

6. V I. }ns-( tat/rev SO): [used for VI Jns-gpl60lIIB/IRES//-cvIllB (SD) and 

VlJns-gpl60lIIB(SD); sequenced using an oligomer complementary to 
gpI60 reading towards 5'-end of gpl60 and into CMVintA: 5-CCA 
TCT CCA CAA GTG CTG-3', SEQ.ID:70:] 

SE0.ID:71: 

5 -AGA TCT A AGG ACG GTG ACT GCA / TGT ACT ACT TAC TGC TTT GAT 
CMVintA tat/rev SD 

AGA GGA CGG TGA / CTG CAG AAA AGA CCC ATG GAA A-3' 

CMVintA 

7. Vl.Tns-?p160 inB/IRES/rgvTITR (SD) : [gpl60ARES junction 
sequenced using IRES 5'- oligomer, 5'-G CTT CGG CCA GTA ACG-3', 
SEQ.ID:72:] 

SEO.ID:73: 

5 -GGC ACA GCA GAT C/ AG ATG GGG ATC TGA TA TCG CAC TAT TCT TTA 
BGH rev 

GCT CCT GAC TCC TGA CTC-3' 



SEO.ID:74: 
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5'-GGA ATT/ TGA GTC ATC / CCC ATC TTA TAG CAA AAT CCT TTC CAA -3' 
IRES gpl60 

8. V l Jns-ggg-pmilB (SD): 

5 

SEO.ID:75: 

5'-CTT AGA TC/ C CCG CAC GGC AAG AGG CGA GGG GCG GCG ACT GGT-3' 
CMVintA gag (SD) 

^° SEO.ID:76: 

5 -GGC ACA GCA GAT C/ CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 
BGH prt 

TAA TGC TTT TAT TTT TCT TCT GTC...-3' 

15 

9. vi.ins-ggg-p miiB : 

SEO.ID:77: 

5'-CTT AGA TC/ CAC CAT GGG TGC GAG AGC GTC AGT ATT AA GCG GGG 
^° CMVintA gag 

GGA GAA TTA GAT CGA TGG GAA AAA ATT...-3" 

SEQ.ID:78: 

^ ^ 5'-GGC ACA GCA GAT C/ CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 
BGH prt 

TAA TGC TTT TAT TTT TCT TCT GTC...-3' 

30 

10. V1.Ins-tPA: 



SEO.ID:79: 

5*-TCA CCG TCC TTA GAT C/ ACC ATG GAT GCA ATG AAG AGA GGG CTC TGC 
CMVintA tPA leader 
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TGT GTG CTG CTG CTG TGT GGA GCA GTC TTC GTT TCG CCC AGC GA/ G ATC 
BGH 

5 

TGC TGT GCC TTC TAG TTG CCA GCC-3' 
11. V1.Ins-tPA-gnl20M1M : 

^° SEO-ID:80: 

5'-TTC GTT TCG CCC AGC GA/ TCA CAG AAA AAT TGT GGG TCA CAG TC-3' 
tPA gpl20MN 

SEQ.ID:81: 

15 

5 -GGC ACA GCA GAT CI CAC GTG TTA GCG CTT TTC TCT CTC CAC CAC-3' 
BGH gpl20MN 



^ 12. V1.I-SIV MAC251 n28 gflg 
SEO.ID:82: 

5 -TCA CCG TCC TTA GAT CT/ ACC ATG GGA CCA GTA CAA CAA ATA GGT 

CMVintA p28 gag,.. 

GGT AAC TAT GTC CAC CTG CCA TTA AGC CCG AGA ACA-3' 

5 

SEO.ID:83: 

5'-GGC ACA GCA GAT CT/ TTA CAT TAA TCT AGC CTT CTG TCC CGG TCC-3' 
BGH p28 gag 

° 13. VI I-SIV]vtAr251^^^ 



SEO.ID:84: 

5'-TCA CCG TCC TTA GAT C/ GGT ACA ACC ATG GGT GGA GCT ATT TCC ATG 
CMVintA nef 
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AGG CAA TCC A AG CCG GCT GGA GAT CTG AC A GAA A-3' 
SEQ.ID:85: 

5'-GGC ACA GCA GAT CA/ C CTA GGT TAG CCT TCT TCT AAC CTC TTC CTC 
^ BGH nef..., 

TGA CAG GCC TGA CTT GCT TCC AAC TCT TCT GGG TAT CTA G-3' 

14. W^ns-tatlrevlenv : 

^° SEQ.ID:86: 

5'-ACC GTC CTT AGA T/ TC GAC ATA GCA GAA TAG GCG TTA CTC GAC AGA 
CMVintA tatlrevlenv 

GGA GAG CAA GAA ATG GAG CCA GTA GAT CCT AGA CTA GAG CCC TGG-3' 

15 

SEQ.ID:87: 

5'-GGC ACA GCA GAT C/ C GAG ATG CTG CTC CCA CCC CAT CTG CTG-3' 
BGH tatlrevlenv 

^° EXAMPLE 12 

T CELL PROLIFERATION ASSAYS : 

PBMCs can be obtained as described in Example 6 from 
above and tested for recall responses to specific antigen as determined 
by proliferation within the PBMC population. Proliferation is 
monitored using ^H-thymidine which is added to the cell cultures for 
the last 18-24 hours of incubation before harvest. Cell harvesters retain 
isotope-containing DNA on filters if proliferation has occurred while 
quiescent cells do not incorporate the isotope which is not retained on 
the filter in free form. For either rodent or primate species 4 X 1 0^ 

^° cells are plated in 96 well microliter plates in a total of 200 \x.\ of 
complete media (RPMI/10% fetal calf serum). Background 
proliferation responses are determined using PBMCs and media alone 
while nonspecific responses are generated by using lectins such as 
phytohaemagglutin (PHA) or concanavalin A (Con A) at 1 - 5 |ig/ml 
concentrations to serve as a positive control. Specific antigen consists of 
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either known peptide epitopes, purified protein, or inactivated virus. 
Antigen concentrations range fronn 1-10 |iM for peptides and 1-10 
|Lig/ml for protein. Lectin-induced proliferation peaks at 3-5 days of 
cell culture incubation while antigen-specific responses peak at 5-7 days. 
Specific proliferation occurs when radiation counts are obtained which 
are at least three-fold over the media background and is often given as a 
ratio to background, or Stimulation Index (SI). HIV gpl60 is known to 
contain several peptides known to cause T cell proliferation of 
gpl60/gpl20 immunized or HIV-infected individuals. The most 
commonly used of these are: Tl 

(LysGlnllelleAsnMetTrpGlnGluValGlyLysAlaMetTyrAla, 
SEQ.ID:88:); T2 (HisGluAspIlelleSerLeuTrpAspGlnSerLeuLys, 
SEQ.ID:89:); and, TH4 

(AspArgVallleGluValValGlnGlyAalTyrArgAlalleArg, SEQ.ID:90:). 
These peptides have been demonstrated to stimulate proliferation of 
PBMC from antigen-sensitized mice, nonhuman primates, and humans. 

REFERENCES : 

L. Arthur et ah, J. Virol. 63, 5046 (1989). [chimp/HIV challenge 

model/virus neut. assay] 
T. Maniatis et al., Molec. cloning: a lab. manual, p. 280 Cold Spring 
Harbor 

Lab., CSH, NY (1982) [genomic DNA purif.] 
E. Emini et ah, J. Virol. 64, 3674 (1990) [chimp challenge, neut assay] 

EXAMPLE 13 

Vector VIR Preparation 

In an effort to continue to optimize our basic vaccination 
vector, we prepared a derivative of VI Jns which was designated as 
VIR. The purpose for this vector construction was to obtain a 
.minimum-sized vaccine vector, i.e., without unnecessary DNA 
sequences, which still retained the overall optimized heterologous gene 
expression characteristics and high plasmid yields that VIJ and VI Jns 
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afford. We determined from the literature as well as by experiment 
that (1) regions within the pUC backbone comprising the E. coli origin 
of replication could be removed without affecting plasmid yield from 
bacteria; (2) the 3'-region of the kan^ gene following the kanamycin 
open reading frame could be removed if a bacterial terminator was 
inserted in its stead; and, (3) -300 bp from the 3'- half of the BGH 
terminator could be removed without affecting its regulatory function 
(following the original Kpnl restriction enzyme site within the BGH 
element). 

VI R was constructed by using PGR to synthesize three 
segments of DNA from VlJns representing the CMVintA 
promoter/BGH terminator, origin of repHcation, and kanamycin 
resistance elements, respectively. Restriction enzymes unique for each 
segment were added to each segment end using the PGR oligomers: 
Sspl and Xhol for GMVintA/BGH; EcoRV and BamHI for the kan r 
gene; and. Bell and Sail for the ori r. These enzyme sites were chosen 
because they allow directional hgation of each of the PGR-derived DNA 
segments with subsequent loss of each site: EcoRV and Sspl leave blunt- 
ended DNAs which are compatible for ligation while BamHI and BcII 
leave complementary overhangs as do Sail and Xhol. After obtaining 
these segments by PGR each segment was digested with the appropriate 
restriction enzymes indicated above and then ligated together in a single 
reaction mixture containing all three DNA segments. The 5'-end of the 
ori r was designed to include the T2 rho independent terminator 
sequence that is normally found in this region so that it could provide 
termination information for the kanamycin resistance gene. The hgated 
product was confirmed by restriction enzyme digestion (>8 enzymes) as 
well as by DNA sequencing of the ligation junctions. DNA plasmid 
yields and heterologous expression using viral genes within VIR appear 
similar to VlJns. The net reduction in vector size achieved was 1346 bp 
(VlJns = 4.86 kb; VIR = 3.52 kb), see figure 1 1, SEQ.ID:45:. 

PGR oligomer sequences used to synthesize VIR (restriction enzyme 
sites are underlined and identified in brackets following sequence): 
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(1 ) 5'-GGT ACA AAT ATT GG CTA TTG GCC ATT GCA TAG G-3' [Sspl], 
SEQ.ID:91:, 

(2) 5'-CCA CAT CTC GAG GAA CCG GGT CAA TTC TTC AGC ACC-3' [Xhol], 
SEQ.ID:92: 

(for CMVintA/BGH segment) 

(3) 5'-GGT ACA GAT ATC GGA AAG CCA CGT TGT GTC TCA AAA TC- 
3'[EcoRV], SEQ.ID:93: 

(4) 5'-CCA CAT GGA TCC G TAA TGC TCT GCC AGT GTT ACA ACC-3' 
[BamHI], SEQ.ID:94: 

(for kanamycin resistance gene segment) 

(5) 5'-GGT ACA TGA TCA CGT AGA AAA GAT CAA AGG ATC TTC TTG- 
3'[Bcll], SEQ.ID:95:, 

(6) 5'-CCA CAT GTC GAC CC GTA AAA AGG CCG CGT TGC TGG-3' [Sail], 
SEQ.ID:96: 

(for E. coli origin of replication) 

Ligation junctions were sequenced for V1R using the following 

oligomers: 

5'-GAG CCA ATA TAA ATG TAC-3', SEQ.ID:97: [CMVintA/kanr 
junction] 

5'-CAA TAG CAG GCA TGC-3', SEQ.ID:98: [BGH/ori junction] 
5'-G CAA GCA GCA GAT TAC-3', SEQ.ID:99: [ori/kanr junction] 

EXAMPLE 14 
The HIV genes which appear to be the most important for 
PNV development are env and gag. Both env and gag require the HIV 
regulatory protein, rev, for either viral or heterologous expression. 
Because efficient expression of these gene products is essential for PNV 
function, two types of vectors, rev-dependent and rev-independent, 
were tested for vaccination purposes. Unless stated otherwise, all genes 
were derived from the HIV-1 (IIIB) laboratory isolate. 
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A. env : Depending upon how large a gene segment is used, varying 
efficiencies of rev-independent envy expression may be achieved by 
replacing the native leader peptide of env with the leader peptide from 
the tissue-specific plasminogen activator (tPA) gene and expressing the 
resulting chimeric gene behind the CMV promoter with the CMV 
intronA. VlJns-tPA-gpl20 is an example of a secreted gpl20 vector 
constmcted in this fashion which functions to yield anti-gpl20 immune 
responses in vaccinated mice and monkeys. 

Published reports indicate that membrane-anchored 
proteins may induce a more substantial antibody responses compared to 
secreted proteins. Membrane-anchored proteins may also induce 
antibody responses to additional immune epitopes. To test this 
hypothesis, VlJns-tPA-gpl60 and VlJns-rev/env were prepared. The 
tPA-gpl60 vector produced detectable quantities of gpl60 and gpl20, 
without the addition of ev, as shown by immunoblot analysis of RD 
cells transfected in vitro , although expression was much lower than that 
obtained for rev/env, a rev-dependent gpl60-expressing plasmid. This 
may be due to the presence of inhibitory regions, which confer rev 
dependence upon the gpl60 transcript occur at multiple sites within 
gpl60 including at the COOH-terminus of gp41. 

Vectors containing truncated forms of tPA-gpl60, tPA- 
gpl43 and tPA-gpl50, designed to increase the overall expression of 
env by elimination of these inhibitory sequences, were prepared. The 
truncated gpl60 vectors lack intracellular gp41 regions containing 
peptide motifs (such as leu-leu) which are known to cause diversion of 
membrane proteins to the lysosomes rather than the cell surface. Thus, 
gpl43 and gpl50 may be expected to increase the transport of protein to 
the cell surface compared to full-length gpl60 where these proteins may 
be better able to elicit anti-gpl60 antibodies following DNA 
vaccination. 

A quantitative ELISA for gpl60/gpl20 expression in cell 
transfectants was developed to determine the relative expression 
capabilities for these vectors as well as for an additional vector which 
combines the features of tPA-gpl60 and rev/env (vector rev/tPA- 
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gpl60). In vitro transfection of 293 cells followed by quantitation of 
cell-associated vs. secreted/released gpl20 yielded the following results: 

(1) for the analogous plasmid pair, rev/env and rev/tPA- 
gpl60, substitution of the native leader peptide in gpl60 with the tPA 
leader sequence did not increase the total expression of gpl60 or the 
amount of released gpl20. This suggests that the leader peptide is not 
responsible for inefficient trafficking of gpl60 to the cell surface in 
these cells. 

(2) tPA-gpl60 expresses 5-1 OX less gpl60 than rev/env 
with similar proportions retained intracellularly vs. trafficked to the 
cell surface. 

(3) tPA-gpl43 gave 3-6X greater secretion of gpl20 than 
rev/env with only low levels of cell-associated gpl43 confirming that 
the cytoplasmic tail of gpl60 causes intracellular retention of gpl60 
which can be overcome by partial deletion of this sequence. 

(4) tPA-gpl50 gave only low levels of gpl60 in both cells 
and media, indicating either a problem with this construct or inherent 
instability of the truncated protein. 

tPA-gpl20 derived from a primary HIV isolate (containing 
the North American consensus V3 peptide loop; macrophage tropic and 
nonsyncytia-inducing phenotypes) gave high expression/secretion of 
gpl20 with transfected 293 cells demonstrating that it was cloned in a 
functional form. 

EXAMPLE 15 

Serological Assays 

A. Antibody Responses: 

1. gpl20 PNVs 

An ID vs. IM vaccination experiment in mice was 
completed using VI Jns-tPA-gp 120 (100, 10, 1 |Lig: 3X). ID vaccination 
appeared superior at the lower doses following initial rounds but all 
doses were equivalent after three rounds. 

Rhesus monkeys (RHM) and African green monkeys 
(AGM) were vaccinated with the VlJns-tPA-gpl20 (MN) PNV. Peak 
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GMTs for gpl20 antibodies differed by more than five-fold between 
these two primate species: 1780 (AGM) and 310 (RHM). These results 
indicate that substantially larger antibody titers can be elicited in AGM 
compared to RHM and suggest that higher HIV neutralization titers may 
be obtained by AGM vaccination. 

2. gpl60 PNVs : VlJns-rev/env vaccination (IM) of mice 
did not yield antibodies to gpl60 until three injections while ID 
vaccination yielded responses after one round which remained higher 
than those produced by IM throughout the experiment (GMTs = 2115 
(ID) and 95 (IM); 200 |ig/mouse). This suggests that rev-dependent 
constructs can function as immunogens better by the ID route. 

RHM receiving ID or IM inoculations with VlJns-rev/env 
showed peak GMTs = 790 and 140, respectively, following 4-5 
inoculations (2 mg/round). These results agree with those found for 
mice showing that this rev dependent PNV has greater efficacy for 
antibody generation by ID vaccination although the rev -independent 
constmct VI Jns-tPA-gpl20 did not. RHM receiving tPA-gpl60 DNA 
(IM) showed lower, more variable antibody responses than those 
receiving rev/env which corroborate our determination that this vector 
expresses gpl60 4-7X less efficiently than rev/env. 
B. In Vitro Virus Neutralization 

An infectivity reduction neutralization assay (p24 gag 
production readout) using HIV(MN) as a virus source was performed 
by Quality Biologicals, Inc. (QBI). At low virus input (100 TCID50) 
complete neutralization was seen at 1/10 dilutions of sera for all three 
antisera with at least 80-90 % reduction in virus production observed in 
all samples up to 1/80 dilutions as compared to matched prebleed sera. 
However, at higher virus input (1000 TCID50), no neutralization was 
observed for any sample. 

RHM were tested for Hiy (HIB) neutralization (QBI), 
using 100 TCID50 of input virus, following vaccinations with tPA- 
gpl20 (inB) DNA. In two different experiments the best neutralization 
results were obtained at serum dilutions of 10 (40-99% reduction of 'p24 
gag ) with gag reduction observed in some samples at dilutions as high 
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as 80-fold. The most consistent samples in this assay had anti-gpl20 
antibody ELISA endpoint titers of at least 2000-3000. 

RHM were similarly tested for HIV (IIIB) neutralization 
(OBI) following vaccinations with rev/env DNA. Overall, low levels of 
neutralization were observed: two of three RHM showed neutralization 
ranging up to 84% at a serum dilution of 10 with p24 gag reduction 
observed at subsequent dilutions of 20 or 40 while one sample did not 
show any evidence of neutralization. These samples had anti-gpl20 
antibody ELISA titers of 700-800 indicating that this is the minimum 
useful titer range for testing sera derived from gpl60 DNA vaccine 
experiments in neutralization assays. 
C. Facilitators for Enhanced Immunity 

Several experiments were initiated to test plasmid DNA 
formulations which have been reported to enhance DNA uptake 
following vaccination and increase either reporter gene expression or 
immune responses in mouse or monkey vaccinees. Hypertonic sucrose 
(up to 20-25%, w/v) DNA solutions have been reported to give more 
uniform distribution of DNA uptake, as evidenced by reporter gene 
expression, and was used in experiments in which substantial gpl60- 
specific antibodies were elicited in rodents and nonhuman primates 
vaccinated with a rev/gpl60 plasmid. The anesthetic, bupivicaine (0.25- 
0.75%, w/v), has also been reported to significantly enhance DNA 
vaccine-mediated immune responses in mice and nonhuman primates 
when used either as a pretreatment for IM injection, or as by co- 
injection with DNA in isotonic saline solution. 

Our initial results with bupivicaine showed that substantial 
mortality was caused by IM treatment with 0.5% solutions. Mortality 
varied depending on the volume of solution used and whether the mice 
were injected while under anesthetic (> 0.1 mL w/o anesthetic gave 
highest mortahty). Our experiments have used 0.25% solutions 
without significant mortality either as a pre-treatment or a co-treatment 
and using gpl20 or rev/env PNVs. A preliminary experiment using 
bupivicaine as a pre-treatment for three vaccination rounds did not 
show any enhancement of immune responses relative to control mice 
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while a larger experiment using both ID and IM sites as a pre-treatment 
or co-treatment has not shown any increased antibody levels following 
one injection and appeared to decrease antibody responses in some 
groups. Three vaccinations are planned in the current study. 

This sucrose formulation experiment tested a variety of 
conditions described in the literature. Sucrose concentration was tested 
at 10, 15, 20, and 25% in saline or PBS solution containing 0.1 mg/mL 
of tPA-gpl20 plasmid. All samples were tested as a co-injection by IM 
or ID routes except for a 25% sucrose/PBS group that received this 
solution 15-30 minutes prior to IM DNA/PBS injection. Serum data 
derived from bleeds following the first vaccination did not show any 
enhancement of antibody responses. 

EXAMPLE 16 

T Lvmphocvte Responses : 

A. Proliferati on and Cytokine Secretion 

T lymphocytes which have been primed in vivo with 
antigen can proliferate and secrete cytokines during in vitro cell culture 
after exogenous addition of priming antigen. Responding T cells 
usually have a MHC Class D-restricted, CD4-I- (helper) phenotype. 
Helper T cells can be functionally grouped according to the types of 
cytokines they secrete following stimulation by antigen: ThI cells 
secrete primarily IL-2 and g-interferon while Th2 cells are associated 
with IL-4, IL-5, and IL-10 secretion. ThI lymphocytes and cytokines 
promote cellular immunity, including CTL and DTH responses, while 
Th2 cells and cytokines promote B cell activation for humoral 
immunity. We have previously tested for these responses in mice and 
nonhuman primates (AGM and RHM), using rgpl20iiiB for antigen in 
vitro, after vaccination with HIV tPA-gpl20 PNVs and shown that T 
cells from vaccinees of both species exhibit proliferative responses to 
gpl20 in vitro and that these responses are ThI -like and long-lived (> 6 
months) in mice. These studies were continued with a rev PNV. 

1. mouse studies: Mice vaccinated either 3X or IX with 
200 |ig VlJns-rev were tested for in vitro proliferation to recombinant 
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rev (r-rev) protein. Mice vaccinated 3X showed stimulation indices (SI: 
ratio of proliferation of immune cells with and without immunizing 
antigen) of 9-12 while mice receiving IX were the same as background 
(Sis = 2-3). Splenic T cells from all rev vaccinees, but not control 
mice, secreted g-interferon in response to r-rev antigen (2.4-2.8 ng/ml, 
3X; 0.4-0.7 ng/ml, IX) while no IL-4 was detected in culture 
supematants (detection sensitivities = 47 pg/ml and 15 pg/ml for g- 
interferon and IL-4, respectively) showing these T cell responses to be 
THl-like in nature as we found for gpl20 DNA vaccinees. Cytokine 
secretion may be a more sensitive assay than proliferation to specific 
antigen for determining T cell memory responses. Similar results were 
found for mice tested at least six months post vaccination. Antibodies to 
rev were not detected in any vaccinee sera as may be expected for this 
intracellular protein. 

2. Monkey Studies : Three RHM showed strong in vitro T 
cell proliferation ( Sis = 9-30) to r-rev following two vaccinations with 
VlJns-rev. No anti-rev antibodies were detected in any monkeys. 
These results corroborate the above mouse/rev experiments and confirm 
that strong T cell responses can be induced by rev PNVs without 
concomitant induction of antibody responses. 

Further experiments using tPA-gpl20 DNA vaccination of 
RHM showed that (i) in vitro T cell proliferation to rgpl20 was 
obtained following one vaccination; (ii) primary responses were boosted 
following a second vaccination; and, (iii) similar proliferations were 
obtained with these vaccinees as for SHIV-infected RHM (Sis = 5-70 and 
5-35, respectively). 

B. Anti-eA7v Cytotoxic T Lymphocytes 

Two of four RHM monkeys vaccinated with tPA-gpl20 
(IM) and gp 1 60/IRES/rev (ID) PNVs showed significant CTL activities 
(> 20% lysis at 10:1 E/T) against homologous target cells six weeks 
following one vaccination. Two weeks post a second vaccination all 
four monkeys showed cytotoxicities ranging from 20 -35% lysis at 20:1 
E/T. All CTL activities in this assay design were MHC Class I 
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restricted: removal of CD8+ T cells completely removed cytotoxicities 
in all four monkeys. CTL responses waned over several months and 
were boosted to > original levels with subsequent re-vaccination. These 
CTL activities were characterized as the most potent for vaccine- 
mediated responses observed in RHM. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Shiver, John W 
Liu, Margaret A 
Perry, Helen C 

(ii) TITLE OF INVENTION: COORDINATE IN VIVO GENE EXPRESSION 
(iii) NUMBER OF SEQUENCES: 100 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Christine E. Carty 

(B) STREET: 12 6 Lincoln Avenue, P.O. Box 2000 

(C) CITY: Rahway 

(D) STATE: New Jersey 

(E) COUNTRY: United States of America 

(F) ZIP: 07065 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASS IFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Carty, Christine E. 

(B) REGISTRATION NUMBER: 36,090 

(C) REFERENCE /DOCKET NUMBER: 19188Y 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (908) 594-6734 

(B) TELEFAX: (908) 594-4720 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(iv) 



ANTI- SENSE: NO 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie His lie Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Gly Glu He He Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENS33: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Cys Thr Arg Pro Asn Tyr Asn Lys Arg Lys Arg He His He Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Lys Asn He He Gly Thr He Arg Gin 
20 25 30 

Ala His Cys 
35 

( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) . TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



wo 95/24485 



PCT/US95/02633 



- 88 - 



(iv) ANTI -SENSE: NO 
(V) FRAGMENT TYPE: internal 



SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg lie Arg lie Gin Arg 
5 10 15 

Pro Gly Arg Ala Phe Val Thr lie Gly Lys lie Gly Asn Met Arg 
20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His lie Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Gly Lys lie lie Gly Asn lie Arg Gin 
20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Thr Arg Pro Ser Asn Asn Asn Thr Arg Lys Ser lie His lie Gly 
15 10 15 

Pro Gly Lys Ala Phe Tyr Ala Thr Gly Ala lie lie Gly Asp lie Arg 
20 25 30 

Gin Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Ser lie His lie Ala Pro 
15 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie Arg Gin 
20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



wo 95/24485 



- 90 - 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
CTATATAAGC AGAGCTCGTT TAG 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
GTAGCAAAGA TCTAAGGACG GTGACTGCAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GTATGTGTCT GAAAATGAGC GTGGAGATTG GGCTCGCAC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
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(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGCGAGCCC AATCTCCACG CTCATTTTCA GACACATAC 3 9 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTACAAGAT CTACTATAGG GAGACCGGAA TTCCGC 3 6 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 



60 
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CAGCTTGTCT 


GTAAGCGGAT 


GCCGGGAGCA 


GACAAGCCCG 


TCAGGGCGCG 


TCAGCGGGTG 


120 


TTGGCGGGTG 


TCGGGGCTGG 


CTTAACTATG 


CGGCATCAGA 


GCAGATTGTA 


CTGAGAGTGC 


180 


ACCATATGCG 


GTGTGAAATA 


CCGCACAGAT 


GCGTAAGGAG 


AAAATACCGC 


ATCAGATTGG 


240 


CTATTGGCCA 


TTGCATACGT 


TGTATCCATA 


TCATAATATG 


TACATTTATA 


TTGGCTCATG 


300 


TCCAACATTA 


C CGC C ATGTT 


GACATTGATT 


ATTGACTAGT 


TATTAATAGT 


AATCAATTAC 


360 


GGGGTCATTA 


GTTCATAGCC 


CATATATGGA 


GTTCCGCGTT 


ACATAACTTA 


CGGTAAATGG 


420 


CCCGCCTGGC 


TGACCGCCCA 


ACGACCCCCG 


CCCATTGACG 


TCAATAATGA 


CGTATGTTCC 


480 


CATAGTAACG 


CCAATAGGGA 


CTTTCCATTG 


ACGTCAATGG 


GTGGAGTATT 


TACGGTAAAC 


540 


TGCCCACTTG 


GCAGTACATC 


AAGTGTATCA 


TATGCCAAGT 


ACGCCCCCTA 


TTGACGTCAA 


600 


TGACGGTAAA 


TGGCCCGCCT 


GGCATTATGC 


CCAGTACATG 


ACCTTATGGG 


ACTTTCCTAC 


660 


TTGGCAGTAC 


ATCTACGTAT 


TAGTCATCGC 


TATTACCATG 


GTGATGCGGT 


TTTGGCAGTA 


720 


CATCAATGGG 


CGTGGATAGC 


GGTTTGACTC 


ACGGGGATTT 


CCAAGTCTCC 


ACCCCATTGA 


780 


CGTCAATGGG 


AGTTTGTTTT 


GGCACCAAAA 


TCAACGGGAC 


TTTCCAAT^T 


GTCGTAACAA 


840 


CTCCGCCCCA 


TTGACGCAAA 


TGGGCGGTAG 


GCGTGTACGG 


TGGGAGGTCT 


ATATAAGCAG 


900 


AGCTCGTTTA 


GTGAACCGTC 


AGATCGCCTG 


GAGACGCCAT 


CCACGCTGTT 


TTGACCTCCA 


960 


TAGAAGACAC 


CGGGACCGAT 


CCAGCCTCCG 


CGGCCGGG AA 


CGGTGCATTG 


GAACGCGGAT 


1020 


TCCCCGTGCC 


AAGAGTGACG 


TAAGTACCGC 


CTATAGAGTC 


TATAGGCCCA 


CCCCCTTGGC 


1080 


TTCTTATGCA 


TGCTATACTG 


TTTTTGGCTT 


GGGGTCTATA 


CACCCCCGCT 


TCCTCATGTT 


1140 


ATAGGTGATG 


GTATAGCTTA 


GCCTATAGGT 


GTGGGTTATT 


GACCATTATT 


GACCACTCCC 


1200 


CTATTGGTGA 


CGATACTTTC 


CATTACTAAT 


CCATAACATG 


GCTCTTTGCC 


ACAACTCTCT 


1260 


TTATTGGCTA 


TATGCCAATA 


CACTGTCCTT 


CAGAGACTGA 


CACGGACTCT 


GTATTTTTAC 


1320 


AGGATGGGGT 


CTCATTTATT 


ATTTACAAAT 


TCACATATAC 


AACACCACCG 


TCCCCAGTGC 


1380 


CCGCAGTTTT 


TATTAAACAT 


AACGTGGGAT 


CTCCACGCGA 


ATCTCGGGTA 


CGTGTTCCGG 


1440 


ACATGGGCTC 


TTCTCCGGTA 


GCGGCGGAGC 


TTCTACATCC 


GAGCCCTGCT 


CCCATGCCTC 


1500 


CAGCGACTCA 


TGGTCGCTCG 


gCagctcctt 


GCTCCTAACA 


GTGGAGGCCA 


GACTTAGGCA 


1560 


CAGCACGATG 


CCCACCACCA 


CCAGTGTGCC 


GCACAAGGCC 


GTGGCGGTAG 


GGTATGTGTC 


1620 


TGAAAATGAG 


CTCGGGGAGC 


GGGCTTGCAC 


CGCTGACGCA 


TTTGGAAGAC 


TTAAGGCAGC 


1680 


GGCAGAAGAA 


GATGCAGGCA 


GCTGAGTTGT 


TGTGTTCTGA 


TAAGAGTCAG 


AGGTAACTCC 


1740 
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CGTTGCGGTG CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 180 0 

GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA TGGGTCTTTT 18 6 0 

CTGCAGTCAC CGTCCTTAGA TCTGCTGTGC CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 1920 

CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG GTGCCACTCC CACTGTCCTT TCCTAATAAA 1980 

ATGAGGAAAT TGCATCGCAT TGTCTGAGTA GGTGTCATTC TATTCTGGGG GGTGGGGTGG 2040 

GGCAGCACAG CAAGGGGGAG GATTGGGAAG ACAATAGCAG GCATGCTGGG GATGCGGTGG 210 0 

GCTCTATGGG TACCCAGGTG CTGAAGAATT GACCCGGTTC CTCCTGGGCC AGAAAGAAGC 2160 

AGGCACATCC CCTTCTCTGT GACACACCCT GTCCACGCCC CTGGTTCTTA GTTCCAGCCC 222 0 

CACTCATAGG ACACTCATAG CTCAGGAGGG CTCCGCCTTC AATCCCACCC GCTAAAGTAC 22 8 0 

TTGGAGCGGT CTCTCCCTCC CTCATCAGCC CACCAAACCA AACCTAGCCT CCAAGAGTGG 23 40 

GAAGAAATTA AAGCAAGATA GGCTATTAAG TGCAGAGGGA GAGAAAATGC CTCCAACATG 2400 

TGAGGAAGTA ATGAGAGAAA TCATAGAATT TCTTCCGCTT CCTCGCTCAC TGACTCGCTG 2460 

CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA 2520 

TCCACAGAAT CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC 2580 

AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG 2640 

CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT ATAAAGATAC 27 0 0 

CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT GCCGCTTACC 27 6 0 

GGATACCTGT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC TTTCTCAATG CTCACGCTGT 2820 

AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 2880 

GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA CCCGGTAAGA 2940 

CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC GAGGTATGTA 3 00 0 

GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG GCTACACTAG AAGGACAGTA 3 0 60 

TTTGGTATCT GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA 312 0 

TCCGGCAAAC AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG 3180 

CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG 3240 

TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG GATCTTCACC 33 00 

TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT AAAGTATATA TGAGTAAACT 33 60 

TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTGTCTATTT 3420 
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CGTTCATCCA TAGTTGCCTG ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA 3 48 0 

CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA 3 540 

TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC AACTTTATCC 3 60 0 

GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT 3 660 

AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG TGTCACGCTC GTCGTTTGGT 3720 

ATGGCTTCAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG 3780 

TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA 3 840 

GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC TTACTGTCAT GCCATCCGTA 39 00 

AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT TCTGAGAATA GTGTATGCGG 3 960 

CGACCGAGTT GCTCTTGCCC GGCGTCAATA CGGGATAATA CCGCGCCACA TAGCAGAACT 402 0 

TTAAAAGTGC TCATCATTGG AT^AACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG 4080 

CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT 4140 

ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC AAAAAAGGGA 420 0 

ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA TTATTGAAGC 4260 

ATTTATCAGG GTTATTGTCT CATGAGCGGA TACATATTTG AATGTATTTA GAAAAATAAA 432 0 

CAAATAGGGG TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA AGAAACCATT 43 8 0 

ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTCG TC 4432 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
ATTGGCTATT GGCCATTGCA TACGTTGTAT CCATATCATA ATATGTACAT TTATATTGGC 
TCATGTCCAA CATTACCGCC ATGTTGACAT TGATTATTGA CTAGTTATTA ATAGTAATCA 



60 



120 
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ATTACGGGGT CATTAGTTCA TAGCCCATAT ATGGAGTTCC GCGTTACATA ACTTACGGTA 18 0 

AATGGCCCGC CTGGCTGACC GCCCAACGAC CCCCGCCCAT TGACGTCAAT AATGACGTAT 240 

GTTCCCATAG TAACGCCAAT AGGGACTTTC CATTGACGTC AATGGGTGGA GTATTTACGG 3 00 

TAAACTGCCC ACTTGGCAGT ACATCAAGTG TATCATATGC CAAGTACGCC CCCTATTGAC 3 60 

GTCAATGACG GTAAATGGCC CGCCTGGCAT TATGCCCAGT ACATGACCTT ATGGGACTTT 42 0 

CCTACTTGGC AGTACATCTA CGTATTAGTC ATCGCTATTA CCATGGTGAT GCGGTTTTGG 480 

CAGTACATCA ATGGGCGTGG ATAGCGGTTT GACTCACGGG GATTTCCAAG TCTCCACCCC 540 

ATTGACGTCA ATGGGAGTTT GTTTTGGCAC CAAAATCAAC GGGACTTTCC AAAATGTCGT 60 0 

AACAACTCCG CCCCATTGAC GCAAATGGGC GGTAGGCGTG TACGGTGGGA GGTCTATATA 6 60 

AGCAGAGCTC GTTTAGTGAA CCGTCAGATC GCCTGGAGAC GCCATCCACG CTGTTTTGAC 72 0 

CTCCATAGAA GACACCGGGA CCGATCCAGC CTCCGCGGCC GGGAACGGTG CATTGGAACG 780 

CGGATTCCCC GTGCCAAGAG TGACGTAAGT ACCGCCTATA GAGTCTATAG GCCCACCCCC 840 

TTGGCTTCTT ATGCATGCTA TACTGTTTTT GGCTTGGGGT CTATACACCC CCGCTTCCTC 900 

ATGTTATAGG TGATGGTATA GCTTAGCCTA TAGGTGTGGG TTATTGACCA TTATTGACCA 960 

CTCCCCTATT GGTGACGATA CTTTCCATTA CTAATCCATA ACATGGCTCT TTGCCACAAC 1020 

TCTCTTTATT GGCTATATGC CAATACACTG TCCTTCAGAG ACTGACACGG ACTCTGTATT 1080 

TTTACAGGAT GGGGTCTCAT TTATTATTTA CAAATTCACA TATACAACAC CACCGTCCCC 1140 

AGTGCCCGCA GTTTTTATTA AACATAACGT GGGATCTCCA CGCGAATCTC GGGTACGTGT 1200 

TCCGGACATG GGCTCTTCTC CGGTAGCGGC GGAGCTTCTA CATCCGAGCC CTGCTCCCAT 1260 

GCCTCCAGCG ACTCATGGTC GCTCGGCAGC TCCTTGCTCC TAACAGTGGA GGCCAGACTT 1320 

AGGCACAGCA CGATGCCCAC CACCACCAGT GTGCCGCACA AGGCCGTGGC GGTAGGGTAT 1380 

GTGTCTGAAA ATGAGCTCGG GGAGCGGGCT TGCACCGCTG ACGCATTTGG AAGACTTAAG 1440 

GCAGCGGCAG AAGAAGATGC AGGCAGCTGA GTTGTTGTGT TCTGATAAGA GTCAGAGGTA 150 0 

ACTCCCGTTG CGGTGCTGTT AACGGTGGAG GGCAGTGTAG TCTGAGCAGT ACTCGTTGCT 1560 

GCCGCGCGCG CCACCAGACA TAATAGCTGA CAGACTAACA GACTGTTCCT TTCCATGGGT 1620 

CTTTTCTGCA GTCACCGTCC TTAGATCTGC TGTGCCTTCT AGTTGCCAGC CATCTGTTGT 1680 

TTGCCCCTCC CCCGTGCCTT CCTTGACCCT GGAAGGTGCC ACTCCCACTG TCCTTTCCTA 1740 

ATAAAATGAG GAAATTGCAT CGCATTGTCT GAGTAGGTGT CATTCTATTC TGGGGGGTGG 1800 
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GGTGGGGCAG CACAGCAAGG GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC 1860 

GGTGGGCTCT ATGGGTACCC AGGTGCTGAA GAATTGACCC GGTTCCTCCT GGGCCAGAAA 192 0 

GAAGCAGGCA CATCCCCTTC TCTGTGACAC ACCCTGTCCA CGCCCCTGGT TCTTAGTTCC 19 80 

AGCCCCACTC ATAGGACACT CATAGCTCAG GAGGGCTCCG CCTTCAATCC CACCCGCTAA 2040 

AGTACTTGGA GCGGTCTCTC CCTCCCTCAT CAGCCCACCA AACCAAACCT AGCCTCCAAG 210 0 

AGTGGGAAGA AATTAAAGCA AGATAGGCTA TTAAGTGCAG AGGGAGAGAA AATGCCTCCA 2160 

ACATGTGAGG AAGTAATGAG AGAAATCATA GAATTC 219 6 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 60 

CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 12 0 

TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA CTGAGAGTGC 180 

ACCATATGCG GTGTGAAATA CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG 240 

CTATTGGCCA TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 3 00 

TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT AATCAATTAC 3 60 

GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT ACATAACTTA CGGTAAATGG 42 0 

CCCGCCTGGC TGACCGCCCA ACGACCCCCG CCCATTGACG TCAATAATGA CGTATGTTCC 480 

CATAGTAACG CCAATAGGGA CTTTCCATTG ACGTCAATGG GTGGAGTATT TACGGTAAAC 540 

TGCCCACTTG GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 60 0 

TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG ACTTTCCTAC 660 

TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG GTGATGCGGT TTTGGCAGTA 720 



wo 95/24485 



PCTAJS95/02633 



- 97 - 



C ATP A ATGGG 


C GTGG AT AGC 


GGTTTGACTC 


ACGGGGATTT 


CCAAGTCTCC 


ACCCCATTGA 


780 


PGTP A ATGGG 


AGTTTGTTTT 


GGCACCAAAA 


TCAACGGGAC 


TTTCCAAAAT 


GTCGTAACAA 


840 


PTPGGPCCC A 

\^ X V-- ^ VJ V— V-- 


TTGACGCAAA 


TGGGCGGTAG 


GCGTGTACGG 


TGGGAGGTCT 


ATATAAGCAG 


900 


AGPTPGTTTA 


GTGAACCGTC 


AGATCGCCTG 


GAGACGCCAT 


CCACGCTGTT 


TTGACCTCCA 


960 


TAG A AG AC AC 


CGGGACCGAT 


CCAGCCTCCG 


CGGCCGGGAA 


CGGTGCATTG 


GAACGCGGAT 


1020 


TCCPCGTGCC 


AAGAGTGACG 


TAAGTACCGC 


CTATAGAGTC 


TATAGGCCCA 


CCCCCTTGGC 


1080 


TTP TT A TGC A 


TGCTATACTG 


TTTTTGGCTT 


GGGGTCTATA 


CACCCCCGCT 


TCCTCATGTT 


1140 


ATAGGTGATG 


GT AT AGC TT A 


GCCTATAGGT 


GTGGGTTATT 


GACCATTATT 


GACCACTCCC 


1200 




PG ATAPTTTP 


CATTACTAAT 


CCATAACATG 


GCTCTTTGCC 


ACAACTCTCT 


1260 




TATPPP A ATA 


CACTGTCCTT 


C AG AG AC TG A 


CACGGACTCT 


GTATTTTTAC 


1320 




f^Tf^ A TTT A TT 
v_ X ^-.AX X Xi-lX X 


ATTTACAAAT 


TCACATATAC 


AACACCACCG 


TCCCCAGTGC 


1380 


r^r^r^c Ar*TTTT 

V— X X X X 


TATTAAAPAT 

XAX X xxru^v^x-l X 


AACGTGGGAT 


CTCCACGCGA 


ATCTCGGGTA 


CGTGTTCCGG 


1440 


A r* A TfinGP TP 


TTP TP PGGT A 


GCGGCGGAGC 


TTCTACATCC 


GAGCCCTGCT 


CCCATGCCTC 


1500 


o Ar^r^r*" a r'Tr* A 


TGGTPGPTPG 


GCAGCTCCTT 


GCTCCTAACA 


GTGGAGGCCA 


GACTTAGGCA 


1560 


A'— 1 ^ 


PPPAPPAPCA 


CCAGTGTGCC 


GCACT^GGCC 


GTGGCGGTAG 


GGTATGTGTC 


1620 


TG AAA ATG AG 


CTCGGGGAGC 


GGGCTTGCAC 


CGCTGACGCA 


TTTGGAAGAC 


TTAAGGCAGC 


1680 


GGP AG A AG AA 


GATGCAGGCA 


GCTGAGTTGT 


TGTGTTC TG A 


TAAGAGTCAG 


AGGTAACTCC 


1740 


PGTTGPGGTG 


C TGTTAACGG 


TGGAGGGCAG 


TGTAGTCTGA 


GCAGTACTCG 


TTGCTGCCGC 


1800 




AGACATAATA 


GCTGACAGAC 


TAACAGACTG 


TTCCTTTCCA 


TGGGTCTTTT 


1860 


PTGPAGTCAC 


CGTCCTTAGA 


TCTGCTGTGC 


CTTCTAGTTG 


CCAGCCATCT 


GTTGTTTGCC 


1920 


\^\^ X ^ V— V-. *w X 


GCCTTCCTTG 


ACCCTGGAAG 


GTGCCACTCC 


CACTGTCCTT 


TCCTAATAAA 


1980 


ATGAGGAAAT 


TGCATCGCAT 


TGTCTGAGTA 


GGTGTCATTC 


TATTCTGGGG 


GGTGGGGTGG 


2040 


GGCAGCACAG 


CAAGGGGGAG 


GATTGGGAAG 


ACAATAGCAG 


GCATGCTGGG 


GATGCGGTGG 


2100 


GCTCTATGGG 


TACCCAGGTG 


CTGAAGAATT 


GACCCGGTTC 


CTCCTGGGCC 


AGAAAGAAGC 


2160 


AGGCACATCC 


CCTTCTCTGT 


GACACACCCT 


GTCCACGCCC 


CTGGTTCTTA 


GTTCCAGCCC 


2220 


CACTCATAGG 


ACACTCATAG 


CTCAGGAGGG 


CTCCGCCTTC 


AATCCCACCC 


GCTAAAGTAC 


2280 


TTGGAGCGGT 


CTCTCCCTCC 


CTCATCAGCC 


CACCAAACCA 


AACCTAGCCT 


CCAAGAGTGG 


2340 


GAAGAAATTA 


AAGCAAGATA 


GGCTATTAAG 


TGCAGAGGGA 


GAGAAAATGC 


CTCCAACATG 


2400 
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TGAGGAAGTA 


ATGAGAGAAA 


TCATAGAATT 


TCTTCCGCTT 


CCTCGCTCAC 


TGACTCGPTG 




CGCTCGGTCG 


TTCGGCTGCG 


GCGAGCGGTA 


TCAGCTCACT 


CAAAGGCGGT 


A A T A PGP TT* 21 




TCCACAGAAT 


CAGGGGATAA 


CGCAGGAAAG 


AACATGTGAG 


PA A A AGGPP A 


CC AAA T^CCCC 


O c: Q n 

Z Do U 


AGGAACCGTA 


AAAAGGC CGC 


GTTGCTGGCG 


TTTTTC CAT A 

X X X X XVvV^^^X^^ 




CCC'VC'h.CC'h.C 


^ b4 U 


CATCACAAAA 


ATCGACGCTC 


AAGTCAGAGG 


TGGCGAAAPP 


PGAP AGGAPT 


A 1 AAAvjf A i AL, 


O "7 A ri 


CAGGCGTTTC 


CCCCTGGAAG 


CTCCCTCGTG 


CGCTCTCCTG 


TTPPGAPPPT 






GGATACCTGT 


C CGC C TTTC T 


CCCTTCGGGA 


AGCGTGGCGC 


TTTCTCAATG 

XXX X X 


PTP A PGPT'P'P 




AGGTATCTCA 


GTTCGGTGTA 


GGTCGTTCGC 


TCCAAGCTGG 


GCTGTGTGP A 

WW X vJ X vj» X 


PGA 'h.CCCCCC 


O Q Q n 

z o o u 


GTTCAGCCCG 


ACCGCTGCGC 


CTTATCCGGT 


AACTATCGTC 


TTG AGTPn A A 


CCCCCTiTi Ar^A 


z y 4 U 


CACGACTTAT 


CGCCACTGGC 


AGCAGCCACT 


GGTAACAGGA 


TTAGP AGAGP 




■3 n n n 

J U U U 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


CCTAACTACG 


GCTACAPTAG 


A AGG AG AGTA 


J U D u 


TTTGGTATCT 


GCGCTCTGCT 


G AAG CC AGTT 


ACCTTCGG A A 


A A AG APT'TV^r^ 


1 AtsL. i i i\aA 


"0 1 n 

S IzU 


TCCGGCAAAC 


AAAC C ACCGC 


TGGTAGCGGT 


WW X X X X X X X vJ 


TT'T'GG A A A 


r^C A O A rrifTiTv f~^/~^ 
taUAtjrAi lACG 


3180 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATrT'T 


X X \Sf\ X <v X X X X 


^ 1 /^L.L7(jrVjL7 1 \^ 


TGACGCTCAG 


3240 


TGGAACGAAA 


X ^_o^^V_<VJ J. X fit 


A ^^r^<^ A ' 1 " 1 " 1 " I 
nVjV,7V.7 A ± X JL J. ^ 




T'AT'^A A A 7\ 7i 

i A i C AAAAAG 


GATCTTCACC 


33 00 


T AG ATC C TTT 


TAAATTAAAA 


n X V3 X X X X 


A A ATP A A TT^'P 


X 7i A ^^T* A nn A Tt A 


TGAGTAAACT 


33 60 


TGGTCTGACA 


GTTACCAATG 


w X X ^"l^"! X \^J^\J X 


GAGGP ArT'TA 


1 1 L-AvjtCCjA X 


CTGTCTATTT 


3420 


CGTTCATCCA 


TAG TTG C C TG 


ACTCCGGGGt^ 

irtv^ X w wVjVj^ 


dczczcLcznc ccc 

VjvjVjVjVjVj»Vj Vj» V- 




CTCGTGAAGA 


3480 


AGGTGTTGCT 


GACTCATACC 


AGGCC TGAAT 


CGCCCC ATC A 


TPP AGGG Ar^ A 


AAvj i ACjGGr A 


3 d40 


GCCACGGTTG 


ATGAGAGCTT 


TGTTGTAGGT 


GGACCAGTTG 


GTGATTT"Pr» A 
V7 X \Jt\ X X X X 


At- X X 1 1 t»C X i 


0 ^ A A 

3 dO 0 


TGCCACGGAA 


CGGTCTGCGT 


TGTCGGGAAG 


ATGCGTGATC 


TG ATC C TTP A 

X VJJT* X W W> X X ^ t\ 


APTP AGG A A A 


•5 /r C A 
J b b U 


AGTTCGATTT 


ATTCAACAAA 


GCCGCCGTCC 


CGTCAAGTCA 


GCGTAATGCT 


CTGCCAGTGT 


3720 


TACAACCAAT 


TAACCAATTC 


TGATTAGAAA 


AACTCATCGA 


GCATCAAATG 


AAACTGCAAT 


3780 


TTATTCATAT 


CAGGATTATC 


AATACCATAT 


TTTTGAAAAA 


GCCGTTTCTG 


TAATGAAGGA 


3840 


GAAAACTCAC 


CGAGGCAGTT 


CCATAGGATG 


GCAAGATCCT 


GGTATCGGTC 


TGCGATTCCG 


3900 


ACTCGTCCAA 


CATCAATACA 


ACCTATTAAT 


TTCCCCTCGT 


CAAAAATAAG 


GTTATCAAGT 


3960 


GAGAAATCAC 


CATGAGTGAC 


GACTGAATCC 


GGTGAGAATG 


GCAAAAGCTT 


ATGCATTTCT 


4020 


TTCCAGACTT 


GTTCAACAGG 


CCAGCCATTA 


CGCTCGTCAT 


CA/^AATCACT 


CGCATCAACC 


4080 
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AAACCGTTAT 


TCATTCGTGA 


TTGCGCCTGA 


GCGAGACGAA 


ATACGCGATr 


nPT'GTT' A AAA 


4t X 4t U 


GGACAATTAC 


AAACAGGAAT 


CGAATGCAAC 


CGGCGCAGGA 


ACACTGCCAG 


CGC ATP A APZ\ 


4t z u u 


ATATTTTCAC 


CTGAATCAGG 


ATATTCTTC T 


AATACCTGGA 


ATGC TGTTTT 




/t o c n 


GCAGTGGTGA 


GTAACCATGC 


ATCATCAGGA 


GTACGGATAA 


AATGCTTGAT 




/I "5 O A 
4 J 4(5 U 


GGCATAAATT 


CCGTCAGCCA 


GTTTAGTCTG 


ACCATCTCAT 


CTGT A AP ZiTr" 

-t. VJ J. jrlrtV^rTk X \^ 


fix XijrL?V^A/\L.tj 


>i o o n 
4 J o U 


CTACCTTTGC 


CATGTTTCAG 


AAAC AAC TC T 




X X^— V_\^Ax>i 


v- AA 1 L.CjA i AG 


4440 


ATTGTCGCAC 


CTGATTGCCC 


GACATTATCG 


CGAGCCCATT 


X A X X 


T^A A AT'/*' AO/^ A 
1 AAA X C, ACjC A 


4500 


IL-CAIGTTGG 


AATTTAATCG 


CGGCCTCGAG 


CAAGACGTTT 


CCCGTTGAAT 


ATGGCTCATA 


4560 


ACACCCCTTG 


TATTACTGTT 


TATGTAAGCA 


GACAGTTTTA 


TTGTTCATGA 


TGATATATTT 


4620 


TTATCTTGTG 


CAATGTAACA 


TCAGAGATTT 


TGAGACACAA 


CGTGGCTTTC 


CCCCCCCCCC 


4680 


CATTATTGAA 


GCATTTATCA 


GGGTTATTGT 


CTCATGAGCG 


GATACATATT 


TGAATGTATT 


4740 


TAGAAAAATA 


AACAAATAGG 


GGTTCCGCGC 


ACATTTCCCC 


GAAAAGTGCC 


ACCTGACGTC 


4800 


TAAGAAACCA 


TTATTATCAT 


GACATTAACC 


TATAAAAATA 


GGCGTATCAC 


GAGGCCCTTT 


4860 


CGTC 












4864 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCACATAGAT CTGTTCCATG GTTGTGGCAA TATTATCATC G 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GGTACAAGAT CTACCATGGC AGGAAGAAGC GGAGACAGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCACATAGAT CTGATATCGC ACTATTCTTT AGCTCCTGAC TCC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GATCACCATG GATGCAATGA AGAGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGA 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
GATCTCGCTG GGCGAAACGA AGACTGCTCC ACACAGCAGC AGCACACAGC AGAGCCCTCT 60 
CTTCATTGCA TCCATGGT 7 8 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGTACATGAT CAGATATCGC CCGGGCCGAG ATCTTCAGAC TTGGAGGAGG AG 52 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCACATTGAT CAGCTTGTGT AATTGTTAAT TTCTCTGTCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CCCCGGATCC TGATCACAGA AAAATTGTGG GTCACAGTC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CCCCAGGAAT CCACCTGTTA GCGCTTTTCT CTCTGCACCA CTCTTCTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nuclfeic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGTACATGAT CACAGAAAAA TTGTGGGTCA CAGTC 3 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
CCACATTGAT CAGATATCTT ATCTTTTTTC TCTCTGCACC ACTCTTC 47 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GTCACCGTCC TCTATCAAAG CAGTAAGTAG TACATGCA 3 8 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TGTACTACTT ACTGCTTTGA TAGAGGACGG TGACTGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGTACATGAT CAACCATGAG AGTGAAGGAG AAATATCAGC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CCACATTGAT CAGATATCCC CATCTTATAG CAAAATCCTT TCC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
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(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
CCACATTGAT CAGATATCCC CATCTTATAG CAAAATCCTT TCC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
GGTACACTGC AGTCACCGTC CTATGGCAGG AAGAAGCGGA GAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO • 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCACATCAGG TACCCCATAA TAGACTGTGA CC 



32 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGTACAAGAT CTACCATGGG ACCAGTACAA CAAATAGGTG GTAAC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCACATAGAT CTTTACATTA ATCTAGCCTT CTGTCCC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGTACAACCA TGGGTGGAGC TATTTCCATG AGG 
(2) INFORMATION FOR SEQ ID NO : 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCTAGGTTAG CCTTCTTCTA ACCTCTTCC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GGTACAAGAT CTACCATGGG ATGTCTTGGG AATCAGC 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCACATAGAT CTGATATCGT ATGAGTCTAC TGGAAATAAG AGG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GGTACAGGAT CCACCATGGG TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CCACATGGAT CCGCCCGGGC TTACATCTCT GTACAAATTT CTACTAATGC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGTACAGGAT CCCCGCACGG CAAGAGGCGA GGG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GGTACAGGAT CCACCATGGC TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43. : 

CCACATGGAT CCGCCCGGGC CTTTATTGTG ACGAGGGGTC GTTGC 45 

(2) INFORMATION FOR SEQ ID NO: 44: . 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GGTACAGGAT CCCCGCACGG CAAGAGGCGA GGG 33 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) TUSTTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGTACAGGAT CCACCATGGC TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GTACCTCATG AGCCACATAA TACCATG 



27 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGTACAAGAT CTACCATGGC TTGCAATTGT CAGTTGATGC 4 0 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CCACATAGAT CTCCATGGGA ACTAAAGGAA GACGGTCTGT TC 42 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGTACAAGAT CTACCATGAA GGCAAACCTA CTGGTCCTG 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCACATAGAT CTGATATCCT AATCTCAGAT GCATATTCTG CACTGC 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Arg lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys Asn 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

GAAAGAGCAG AAGACAGTGG CAATGA 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
GGGCTTTGCT AAATGGGTGG CAAGTGGCCC GGGCATGTGG 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
CTAACAGACT GTTCCTTTCC ATG 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GGAGTGGCAC CTTCCAGG 18 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI "SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GGAGACAGCG ACGAAGACCT CCTCAAGGCA GTCAGACTCA TCAAG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 57 : 
GATGGCTGGC AACTAGAAGG CACAGCAGAT CTGATATCGC ACTATTCTTT AGCTCCTGAC 
TCCAATATTG T 



60 
71 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CTTAGATCAA CCATGAGAGT GAAGGAGAAA TATCAGCACT TGTGGAGATG GGGGTGGAGA 60 
TGGGGCACCA TGCTCCTTGG GATGTTGATG ATCTGTAGTG CTACAGAAAA ATTGTGGGT 119 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
CTGGCAACTA GAAGGCACAG CAGATCAGAT AGTGTCCCCA TCTTATAGCA AAATCCTTTC 60 
CAAGCCCTGT CTTATTCT 7 8 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAITOEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GATTTAGGTG ACACTATAG 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TAATACGACT CACTATAGGG 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CATGCCTGCA GGTCGACTCT AAATTCCG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
ACCCGGGGAT CCTCTAGCGC GCTTGTCTCT TGTTCCA 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGGACGTGGT TTTCC 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
TATGGCCACA ACCATGGCAG GAAGAAGCGG AGACAGCGAC GAAGACCTCC TCAAGGCAGT 
CAGACT 

(2) INFORMATION FOR SEQ ID NO: 66: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CTCGAGCCAT GGGCCCCTAG ACTATAGCGT GATAAGAAAT CGAGGACTGA GGTTATAACA 
TCCTCTAAGG TGGTTATAAA CTCCCGAAGG 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67 : 

GCTTCGGCCA GTAACG 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



16 



(iv) ANTI -SENSE: NO 



wo 95/24485 



PCT/US95/02633 



119 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TTGCATGCCT GCAGGTGGTA CATGATCAGA TATCGCCCGG GCCGAGATCT TCAGACTTGG 60 
AGGAGGAGAT ATGAGGGACA ATTGGAG 87 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GGGGCGGAAT TTAGAGTCAA TTGATCAGCT TGTGTAATTG TTAATTTCTC TGTCCCACTC 60 
CATCCAGGTC GTGTGATTC 79 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCATCTCCAC AAGTGCTG 18 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 
AGATCTAAGG ACGGTGACTG CATGTACTAC TTACTGCTTT GATAGAGGAC GGTGACTGCA 6 0 

GAAAAGACCC ATGGAAA 77 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 
GCTTCGGCCA GTAACG 16 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGCACAGCAG ATCAGATGGG GATCTGATAT CGCACTATTC TTTAGCTCCT GACTCCTGAC 
TC 



60 
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(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
GGAATTTGAG TCATCCCCAT CTTATAGCAA AATCCTTTCC AA 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CTTAGATCCC CGCACGGCAA GAGGCGAGGG GCGGCGACTG GT 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GGCACAGCAG ATCCGCCCGG GCTTACATCT CTGTACAAAT TTCTACTAAT GCTTTTATTT 60 
TTCTTCTGTC 7 q 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CTTAGATCCA CCATGGGTGC GAGAGCGTCA GTATTAAGCG GGGGGAGAAT TAGATCGATG 60 
GGAAAAAATT 70 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 
GGCACAGCAG ATCCGCCCGG GCTTACATCT CTGTACAAAT TTCTACTAAT GCTTTTATTT 
TTCTTCTGTC 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 
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{ C ) STRANDEDNESS : both 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



TCACCGTCCT TAGATCACCA TGGATGCAAT GAAGAGAGGG CTCTGCTGTG TGCTGCTGCT 60 
GTGTGGAGCA GTCTTCGTTT CGCCCAGCGA GATCTGCTGT GCCTTCTAGT TGCCAGCC 118 



(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TTCGTTTCGC CCAGCGATCA CAGAAAAATT GTGGGTCACA GTC 43 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GGCACAGCAG ATCCACGTGT TAGCGCTTTT CTCTCTCCAC CAC 



43 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
TCACCGTCCT TAGATCTACC ATGGGACCAG TACAACT^T AGGTGGTAAC TATGTCCACC 60 
TGCCATTAAG CCCGAGAACA 30 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGCACAGCAG ATCTTTACAT TAATCTAGCC TTCTGTCCCG GTCC 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
TCACCGTCCT TAGATCGGTA CAACCATGGG TGGAGCTATT TCCATGAGGC AATCCAAGCC 60 
GGCTGGAGAT CTGACAGAAA 80 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GGCACAGCAG ATCACCTAGG TTAGCCTTCT TCTAACCTCT TCCTCTGACA GGCCTGACTT 60 
GCTTCCAACT CTTCTGGGTA TCTAG 85 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
ACCGTCCTTA GATTCGACAT AGCAGAATAG GCGTTACTCG ACAGAGGAGA GCAAGAAATG 60 
GAGCCAGTAG ATCCTAGACT AGAGCCCTGG 9 0 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGCACAGCAG ATCCGAGATG CTGCTCCCAC CCCATCTGCT G 
(2) INFORMATION FOR SEQ ID NO: 88: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(i'v) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Asp Arg Val lie Glu Val Val Gin Gly Xaa Tyr Arg Ala lie Arg 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
GGTACAAATA TTGGCTATTG GCCATTGCAT ACG 
(2) INFORMATION FOR SEQ ID NO: 92: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
CCACATCTCG AGGAACCGGG TCAATTCTTC AGCACC 3 6 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GGTACAGATA TCGGAAAGCC ACGTTGTGTC TCAAAATC 3 8 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CCACATGGAT CCGTAATGCT CTGCCAGTGT TACAACC 
(2) INFORMATION FOR SEQ ID NO: 95: 



37 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
GGTACATGAT CACGTAGAAA AGATCAAAGG ATCTTCTTG 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
CCACATGTCG ACCCGTAAAA AGGCCGCGTT GCTGG 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 
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GAGCCAATAT AAATGTAC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CAATAGCAGG CATGC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 
GCAAGCAGCA GATTAC 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GATATTGGCT ATTGGCCATT GCATACGTTG TATCCATATC ATAATATGTA CATTTATATT 60 

GGCTCATGTC CAACATTACC GCCATGTTGA CATTGATTAT TGACTAGTTA TTAATAGTAA 12 0 

TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC ATAACTTACG 180 

GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC AATAATGACG 24 0 

TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT GGAGTATTTA 300 

CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT 3 60 

GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC CTTATGGGAC 42 0 

TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT GATGCGGTTT 480 

TGGCAGTACA TCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC AAGTCTCCAC 540 

CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT TCCAAAATGT 60 0 

CGTAACAACT CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT 660 

ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC ACGCTGTTTT 720 

GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG GCCGGGAACG GTGCATTGGA 780 

ACGCGGATTC CCCGTGCCAA GAGTGACGTA AGTACCGCCT ATAGAGTCTA TAGGCCCACC 840 

CCCTTGGCTT CTTATGCATG CTATACTGTT TTTGGCTTGG GGTCTATACA CCCCCGCTTC 900 

CTCATGTTAT AGGTGATGGT ATAGCTTAGC CTATAGGTGT GGGTTATTGA CCATTATTGA 9 60 

CCACTCCCCT ATTGGTGACG ATACTTTCCA TTACTAATCC ATAACATGGC TCTTTGCCAC 1020 

AACTCTCTTT ATTGGCTATA TGCCAATACA CTGTCCTTCA GAGACTGACA CGGACTCTGT 1080 

ATTTTTACAG GATGGGGTCT CATTTATTAT TTACAAATTC ACATATACAA CACCACCGTC 1140 

CCCAGTGCCC GCAGTTTTTA TTAAACATAA CGTGGGATCT CCACGCGAAT CTCGGGTACG 1200 

TGTTCCGGAC ATGGGCTCTT CTCCGGTAGC GGCGGAGCTT CTACATCCGA GCCCTGCTCC 1260 

CATGCCTCCA GCGACTCATG GTCGCTCGGC AGCTCCTTGC TCCTAACAGT GGAGGCCAGA 1320 

CTTAGGCACA GCACGATGCC CACCACCACC AGTGTGCCGC ACAAGGCCGT GGCGGTAGGG 13 80 

TATGTGTCTG AAAATGAGCT CGGGGAGCGG GCTTGCACCG CTGACGCATT TGGAAGACTT 1440 

AAGGCAGCGG CAGAAGAAGA TGCAGGCAGC TGAGTTGTTG TGTTCTGATA AGAGTCAGAG 15 0 0 

GTAACTCCCG TTGCGGTGCT GTTAACGGTG GAGGGCAGTG TAGTCTGAGC AGTACTCGTT 1560 
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GCTGCCGCGC 


GCGCCACCAG 


ACATAATAGC 


TGACAGACTA 


ACAGACTGTT 


CCTTTCCATG 


1620 


GGTCTTTTCT 


GCAGTCACCG 


TCCTTAGATC 


TGCTGTGCCT 


TCTAGTTGCC 


AGCCATCTGT 


1680 


TGTTTGCCCC 


TCCCCCGTGC 


CTTCCTTGAC 


CCTGGAAGGT 


GCCACTCCCA 


CTGTCCTTTC 


1740 


CTAATAAAAT 


GAGGAAATTG 


CATCGCATTG 


TCTGAGTAGG 


TGTCATTCTA 


TTCTGGGGGG 


1800 


TGGGGTGGGG 


CAGCACAGCA 


AGGGGGAGGA 


TTGGGAAGAC 


AATAGCAGGC 


ATGCTGGGGA 


1860 


TGCGGTGGGC 


TCTATGGGTA 


CGGCCGCAGC 


GGCCGTACCC 


AGGTGCTGAA 


GAATTGACCC 


1920 


GGTTCCTCGA 


CCCGTAAAAA 


GGCCGCGTTG 


CTGGCGTTTT 


TCCATAGGCT 


CCGCCCCCCT 


1980 


GACGAGCATC 


ACAAAAATCG 


ACGCTCAAGT 


CAGAGGTGGC 


GAAACCCGAC 


AGGACTATAA 


2040 


AGATACCAGG 


CGTTTCCCCC 


TGGAAGCTCC 


CTCGTGCGCT 


CTCCTGTTCC 


GACCCTGCCG 


2100 


CTTACCGGAT 


ACCTGTCCGC 


CTTTCTCCCT 


TCGGGAAGCG 


TGGCGCTTTC 


TCAATGCTCA 


2160 


CGCTGTAGGT 


ATCTCAGTTC 


GGTGTAGGTC 


GTTCGCTCCA 


AGCTGGGCTG 


TGTGCACGAA 


2220 


CCCCCCGTTC 


AGCCCGACCG 


CTGCGCCTTA 


TCCGGTAACT 


ATCGTCTTGA 


GTCCAACCCG 


2280 


GTAAGACACG 


ACTTATCGCC 


ACTGGCAGCA 


GCCACTGGTA 


ACAGGATTAG 


CAGAGCGAGG 


2340 


TATGTAGGCG 


GTGCTACAGA 


GTTCTTGAAG 


TGGTGGCCTA 


ACTACGGCTA 


CACTAGAAGG 


2400 


ACAGTATTTG 


GTATCTGCGC 


TCTGCTGAAG 


CCAGTTACCT 


TCGGAAAAAG 


AGTTGGTAGC 


2460 


TCTTGATCCG 


GCAAACAAAC 


CACCGCTGGT 


AGCGGTGGTT 


TTTTTGTTTG 


CAAGCAGCAG 


2520 


ATTACGCGCA 


GAAAAAAAGG 


ATCTCAAGAA 


GATCCTTTGA 


TCTTTTCTAC 


GTGATCCCGT 


2580 


AATGCTCTGC 


CAGTGTTACA 


ACCAATTAAC 


CAATTCTGAT 


TAGAAAAACT 


CATCGAGCAT 


2640 


CAAATGAAAC 


TGCAATTTAT 


TCATATCAGG 


ATTATCAATA 


CCATATTTTT 


GAAAAAGCCG 


2700 


TTTCTGTAAT 


GAAGGAGAAA 


ACTCACCGAG 


GCAGTTCCAT 


AGGATGGCAA 


GATCCTGGTA 


2760 


TCGGTCTGCG 


ATTCCGACTC 


GTCCAACATC 


AATACAACCT 


ATTAATTTCC 


CCTCGTCAAA 


2820 


AATAAGGTTA 


TCAAGTGAGA 


AATCACCATG 


AGTGACGACT 


GAATCCGGTG 


AGAATGGCAA 


2880 


AAGCTTATGC 


ATTTCTTTCC 


AGACTTGTTC 


AACAGGCCAG 


CCATTACGCT 


CGTCATCAAA 


2940 


ATCACTCGCA 


TCAACCAAAC 


CGTTATTCAT 


TCGTGATTGC 


GCCTGAGCGA 


GACGAAATAC 


3000 


GCGATCGCTG 


TTAAAAGGAC 


AATTACAAAC 


AGGAATCGAA 


TGCAACCGGC 


GCAGGAACAC 


3060 


TGCCAGCGCA 


TCAACAATAT 


TTTCACCTGA 


ATCAGGATAT 


TCTTCTAATA 


CCTGGAATGC 


3120 


TGTTTTCCCG 


GGGATCGCAG 


TGGTGAGTAA 


CCATGCATCA 


TCAGGAGTAC 


GGATAAAATG 


3180 


CTTGATGGTC 


GGAAGAGGCA 


TAAATTCCGT 


CAGCCAGTTT 


AGTCTGACCA 


TCTCATCTGT 


3240 
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AACATCATTG GCAACGCTAC CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT 33 00 

CCCATACAAT CGATAGATTG TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA 33 60 

CCCATATAAA TCAGCATCCA TGTTGGAATT TAATCGCGGC CTCGAGCAAG ACGTTTCCCG 3 42 0 

TTGAATATGG CTCATAACAC CCCTTGTATT ACTGTTTATG TAAGCAGACA GTTTTATTGT 3480 

TCATGATGAT ATATTTTTAT CTTGTGCAAT GTAACATCAG AGATTTTGAG ACACAACGTG 3 54 0 

GCTTTCC 3 547 
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WHAT IS CLAIMED IS: 

1 . A polynucleotide which, upon introduction into a 
mammalian cell induces the co-expression in the cell of at least two gene 
products, comprising: 

a first transcriptional promoter which operates in 
eukaryotic cells upstream from, and in transcriptional control of, a first 
cistron; 

a second cistron downstream from the first cistron, 
under transcriptional control either of the first transcriptional promoter or 
under control of a second transcriptional promoter; 

optionally, a third cistron downstream from the second 
cistron, under transcriptional control either of the first transcriptional 
^ promoter or under control of the second transcriptional promoter, or under 
control of a third transcriptional promoter; and 

a transcriptional terminator following each of the first, 
second and third cistron. 

2. The polynucleotide of Claim 1 wherein the first cistron 
encodes at least one immunogenic epitope of a pathogen or a cancer 
associated antigen. 

3. The polynucleotide of Claim 2 wherein the pathogen is a 

virus. 

25 

4. The polynucleotide of Claim 3 wherein the virus is the 
human immunodeficiency virus (HIV). 

5. The polynucleotide of Claim 2 wherein the first cistron 

30 

encodes a human immunodeficiency virus (HIV) gene selected from env, 
gag, gag/pol, gag/protease, gag and portions of pol not encoding a 
functional polymerase, and pol. 
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6. The polynucleotide of Claim 1 wherein the second 
cistron encodes a human immunodeficiency vims (HIV) REV gene if the 
first cistron encodes an HIV gene, the efficient expression of which is 
dependent on availability within the cell expressing the HIV gene of the 
REV gene product, 

7. The polynucleotide of Claim 6 wherein the first cistron 
encodes an HIV late gene selected from env, gag and pol. 

8. The polynucleotide of Claim 7 wherein the first cistron 
encodes HIV gpl60, HIV gpl20, HIV gp41, HIV gpl20 lacking a CD4 
binding site and HIV env with an immunologically altered V3, the altered 
V3 having an altered glycosylation pattem or substituted V3 loop tips. 

9. The polynucleotide of Claim 6 wherein the third cistron 
encodes a cytokine or a T-cell costimulatory element. 

10. The polynucleotide of Claim 9 wherein the cytokine is 
interferon, GM-CSF, or interleukin. 

0 

1 1 . The polynucleotide of Claim 9 wherein the T-cell 
costimulatory element is a gene encoding a B7 protein. 

^ 12. The polynucleotide of Claim 1 wherein the first cistron 

encodes a REV -independent human immunodeficiency (HIV) epitope, the 
second cistron encodes a cytokine, and the third cistron encodes a T-cell 
costimulatory element, wherein each of the cistrons may also be presented 
in a different order. 

° 13. The polynucleotide of Claim 12 wherein the second 

cistron encodes an interleukin, an interferon, or GM-CSF, and the third 
cistron encodes a B7 protein. 
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14. The polynucleotide of Claim 1 wherein either of the 
second and third cistron is under transcriptional control of the 
transcriptional promoter upstream of the first cistron, a sequence is 
provided upstream of each of the second and third cistrons having the 
function of an intemal ribosome entry site (IRES) to effect efficient 
translation of the second and third cistrons on a bi- or tri-cistronic 
messenger RNA transcribed from the beginning of the first cistron through 
each of the second and third cistrons up to the transcriptional terminator 
following the second or third cistron, 

10 

15. The polynucleotide of Claim 14 wherein the IRES is 
selected from encephalomyocarditis virus (EMCV) IRES, swine vesicular 
virus IRES and poliovims IRES. 

15 

16. The polynucleotide of Claim 14 wherein the first cistron 
encodes a human immunodeficiency virus (HIV) REV dependent gene, the 
second cistron encodes REV, and the third cistron encodes a T-cell 
costimulatory element or a cytokine, and further, wherein the first cistron 
is preceded by a transcriptional promoter and the second and third cistrons 
are each preceded by an IRES and no transcriptional promoter. 

17. The polynucleotide of Claim 16 wherein the first cistron 
encodes an HIV gpl60, the first cistron is preceded by cytomegalovirus 
immediate early promoter, the second cistron encodes HIV REV, the 
optional third cistron encodes an interferon, GM-CSF, an interleukin, or a 
B7 protein. 

1 8. A polynucleotide which comprises contiguous nucleic 
acid sequences which cannot replicate in eukaryotic cells but which are 
capable of being expressed to produce a gene product upon introduction of 
the polynucleotide into eukaryotic tissues in vivo, wherein the gene product 
either acts as an immunostimulant or as an antigen capable of generating an 
immune response, wherein the nucleic acid sequences encode: 

a spliced REV gene; 
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a spliced human immunodeficiency virus (HIV) 
immunogenic epitope; and 

optionally, a cytokine or a T-cell recognition element. 

^ 19, The polynucleotide of Claim 18 wherein the HIV 

immunogenic epitope selected from gag, gag-protease, or env or an 
immunogenic subportion thereof; the cytokine is interleukin-12, and the T- 
cell costimulatory element is a B7 protein. 

20. The polynucleotide of Claim 19 wherein the env 
immunogenic epitope is selected from HIV gpl60, HIV gpl20 and HIV 
gp41. 

21. The polynucleotide of Claim 19 wherein the gag 
immunogenic epitope is pi 7, p24, or pi 5. 

22. A polynucleotide comprising a first gene encoding an 
HIV gag, gag-protease, or env immunogenic epitope, the gene containing a 
REV responsive element (RRE) or having been modified to contain an 

, the gene being operatively linked with a transcriptional promoter 
suitable for gene expression in a mammal, the gene being linked with an 
internal ribosome entry site (IRES), and the IRES being linked with a gene 
encoding a REV gene product. 

25 

23. The polynucleotide construct: 

a) VlJns-rgvTTm which has the junction sequence SEQ JD:56: 

5'-GGA GAG AGC GACGAA GAG CTC CTC AAG GCA GTG AGA CTC ATC AAG-3', and 
30 

SEO.ID:57: 

5 -GAT GGC TGG CAA CTA GAA GGC ACA GCA GAT CT/ GAT ATC GCA CTA 
BGH rev... 



TTC TTT AGC TCC TG A CTC CAA TAT TGT-3' 
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b) Vl.lns-gDl60TllH . which has the junction sequence SEO.ID:58: 

5'-CTT AGA TC/ A ACC ATG AGA GTG AAG GA GAA ATA TCA GCA CTT GTG 
^CMVinta gpl60 

GAG ATG GGG GTG GAG ATG GGG CAC CAT GCT CCT TGG GAT GTT GAT GAT CTG 
TAG TGC TAG AGA AAA ATT GTG GGT-3', 

■"■^and SEO.ID:59: 

5'-CTG GCA ACT AGA AGG CAC AGC AGA TC/ A GAT AGT GTC CCC ATC TTA 
BGH gpl60 

TAG CAA AAT CCT TTC CAA GCC CTG TCT TAT TCT-3' 
15 

c) pOEM-3-IRES . which has the junction sequence SEQ.ID:62: 



5'-CAT GCC TGC AGG TCG ACT CTA/ AAT TCC G... 
pGEM-3 (SP6) IRES 

20 

and SEO ID:63: 

5'-A CCC GGG GAT CCT CT/ A GCG CGC TTG TCT CTT GTT CCA... 
pGEM-3 (T7) IRES 

^ ci) pGEM-3-IRES/rgvirTR . which has the junction sequence SEQ.ID:65: 

5'-TAT GGC CAC AAC C/ AT GGC AGG AAG AAG CGG AGA CAG CGA CGA AGA 

IRES rev 

CCT CCT CAA GGC AGT CAG ACT -3' 
30 



and SEO.ID:66: 

5 -CTC GAG CCA TGG GCC CCT/ AGA CTA TAG CGT GAT AAG AAA TCG AGG 
pGEM-3 rev 
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ACT GAG GTT ATA ACA TCC TCT AAG GTG GTT ATA AAC TCC CGA AGG-3' 

e) pGEM-3-RRE/IRES/rgviTiB , which has the junction sequence SEO.ID:68: 
5 -TTG CAT GCC TGC AGG T/ GGT ACA TGA TCA GAT ATC G CCC GGG / C 

^ pGEM-3 RRE 

CGA GAT CTT CAG ACT TGG AGG AGG AGA TAT GAG GGA CAA TTG GAG-3' 
IRES-5' 

^^and SEO.ID:69: 

5 -GGG GCG GAA TT/ T AGA GTC A/ ATT GAT CAG CTT GTG TAA TTG TTA 

RRE-3' 

ATT TCT CTG TCC CAC TCC ATC CAG GTC GTG TGA TTC...-3' 
15 

f) Vl.Ins-(/gr/rgv SD) . which has the junction sequence SEQ.ID:71: 

5'-AGA TCT A AGG ACG GTG ACT GCA / TGT ACT ACT TAC TGC TTT GAT 
CMVintA tat/rev SD 

20 

AGA GGA CGG TGA / CTG CAG AAA AGA CCC ATG GAA A-3' 

CMVintA 

g) Vl.Tns-gpl60 iiiB/IRES/rgviiTR (SD) . which has the junction sequence 
2%EQ.ID:73: 

5'-GGC ACA GCA GAT C/ AG ATG GGG ATC TGA TA TCG CAC TAT TCT TTA 

BGH rev 
GCT CCT GAC TCC TGA CTC-3' 

^^nd SE0.ID:74: 

5'-GGA ATT/ TGA GTC ATC / CCC ATC TTA TAG CAA AAT CCT TTC CAA -3' 
IRES gpl60 

h) VLIns-ggg-^r^iiiB (SD) . which has the junction sequence SEO.ID:75: 
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5'-CTT AGA TC/ C CCG CAC GGC AAG AGG CGA GGG GCG GCG ACT GGT-3' 
CMVintA gag (SD) 



^ 5 -GGC ACA GCA GAT C/ CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 



and SEO.ID:76: 
: ACA GCA 
BGH prt 

TAA TGC TTT TAT TTT TCT TCT GTC...-3" 

VlJns-gag-pr niTB , which has the junction sequence SEQ.ID:??: 

5 -CTT AGA TC/ CAC CAT GGG TGC GAG AGC GTC AGT ATT AA GCG GGG 
CMVintA gag 

^^GA GAA TTA GAT CGA TGG GAA AAA ATT...-3' 
and SEO.ID:78: 

5 -GGC ACA GCA GAT C/ CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 
BGH prt 

20 

TAA TGC TTT TAT TTT TCT TCT GTC...-3' 

j) Vl.Ins-tPA. which has the junction sequence SEO.ID:79: 
5'-TCA CCG TCC TTA GAT C/ ACC ATG GAT GCA ATG AAG AGA GGG CTC TGC 
25 CMVintA tPA leader 

TGT GTG CTG CTG CTG TGT GGA GCA GTC TTC GTT TCG CCC AGC GA/ G ATC 

BGH 

^°rGC TGT GCC TTC TAG TTG CCA GCC-3' 

k) Vl,|ns-tPA-gp1 20 mN , which has the junction sequence SEO.ID:80: 

5 -TTC GTT TCG CCC AGC GA/ TCA CAG AAA AAT TGT GGG TCA CAG TC-3' 
tPA gpl20MN 
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and SEQ.ID:81: 

5'-GGC ACA GCA GAT C/ CAC GTG TTA GCG CTT TTC TCT CTC CAC CAC-3' 
BGH gpl20MN 

5 

1) V1.T-STVMAC251 p28 gag , which has the junction sequence SEO.ID:82: 

5'-TCA CCG TCC TTA GAT CT/ ACC ATG GGA CCA GTA CAA CAA ATA GGT 

CMVintA p28 gag... 

GGT AAC TAT GTC CAC CTG CCA TTA AGC CCG AGA ACA-3' 
10 

and SEQ.ID:83: 

5'-GGC ACA GCA GAT CT/ TTA CAT TAA TCT AGC CTT CTG TCC CGG TCC-3' 
BGH p28 gag 

^ m) Vl.I-SIVMAC251wgr . which has the junction sequence SEQ.ID:84: 
5'-TCA CCG TCC TTA GAT C/ GGT ACA ACC ATG GGT GGA GCT ATT TCC ATG 

CMVintA nef 

AGG CAA TCC AAG CCG GCT GGA GAT CTG ACA GAA A-3' 
20 

and SE0.ID:X5: 

5'-GGC ACA GCA GAT CA/ C CTA GGT TAG CCT TCT TCT AAC CTC TTC CTC 

BGH nef.... 

TGA CAG GCC TGA CTT GCT TCC AAC TCT TCT GGG TAT CTA G-3' 
25 

n) \l.}ns-tat/rev/env : , which has the junction sequence SEQ.ID:86: 
5 - ACC GTC CTT AGA T/ TC GAC ATA GCA GAA TAG GCG TTA CTC GAC AGA 
CMVintA tatlrevlenv 

^ GGA GAG CAA GAA ATG GAG CCA GTA GAT CCT AGA CTA GAG CCC TGG-3' 

and SEQ.ID:87: 

5'-GGC ACA GCA GAT CI C GAG ATG CTG CTC CCA CCC CAT CTG CTG-3'. 
BGH tatlrevlenv 
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24. A polynucleotide which induces anti-HIV neutralizing 
antibody, HIV specific T-cell immune responses, or protective immune 
responses upon introduction into vertebrate tissue, including human tissue 
in vivo- wherein the polynucleotide comprises a gene encoding a gene 
product selected from HIV gag, HIV gag-protease, and HIV env, the gene 
containing a REV responsive element (RRE), the gene being operatively 
linked with a transcriptional promoter suitable for gene expression in a 
mammal, the gene being linked with an internal ribosome entry site (IRES), 
and the IRES being linked with a second gene, the second gene encoding a 
REV gene product. 

25. A method for co-expression in a single cell in vivo , of at 
Jeast two gene products, which comprises introducing between about 1 ng 
and about 100 mg of the polynucleotide of Claim 1 into the tissue of the 
vertebrate. 

26. A method for inducing immune responses in a vertebrate 
against HIV epitopes which comprises introducing between about I ng and 
about 100 mg of the polynucleotide of Claim 6 into the tissue of the 
vertebrate. 

27. A method for inducing immune responses in a vertebrate 
against HIV epitopes which comprises introducing between about 1 ng and 
about 100 mg of the polynucleotide of Claim 14 into the tissue of the 
vertebrate. 

28. A method for using a REV dependent HIV gene to 
jnduce immune responses in vivo which comprises: 

a) isolating the REV dependent HIV gene; 

b) linking the isolated gene to regulatory sequences such 
that the gene is expressible by virtue of being operatively linked to control 
sequences which, when introduced into a living tissue, direct the 
transcription initiation and subsequent translation of the gene; 
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c) introducing the expressible gene into a living tissue; 

and 

d) introducing a gene encoding HIV REV either in trans 
^ or in cis to the HIV REV dependent gene. 

29. The method of Claim 28 which further comprises 
boosting with additional expressible HIV gene, or boosting with a 
recombinant purified HIV gene product, 

10 

30. The method of Claim 28 wherein the REV-dependent 
HIV gene encodes a gag, or an env gene product. 

31. A method for inducing immune responses against 

^ mfection or disease caused by virulent strains of HIV which comprises 
mtroducing into the tissue of a vertebrate an HIV gene from a first HIV 
strain such that an induced immune response neutralizes infection by the 
first HIV strain but also neutralizes infection by strains heterologous to the 
first strain, wherein the HIV gene encodes a conserved, REV dependent 

^ HIV epitope and a functional REV is provided either in cis or in trans. 

32. A vaccine for inducing immune responses against HIV 
infection which comprises the polynucleotide of Claim 1 and a 
pharmaceutically acceptable carrier. 

25 

33. A method for inducing anti-HIV immune responses in a 
primate which comprises introducing the polynucleotide of Claim 1 into the 
tissue of the primate and concurrently administering interleukin 12 
parenterally. 

30 

34. The method of Claim 33 wherein the first cistron of the 
polynucleotide encodes HIV gpl60, the second cistron of the polynucleotide 
encodes HIV REV, and the third cistron of the polynucleotide encodes B7. 



35. 



A polynucleotide comprising: 
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a) an eukaryotic transcriptional promoter; 

b) an open reading frame 3' to the transcriptional 
promoter encoding an immunogenic HIV epitope wherein the open reading 
frame has a splice donor sequence at the 5'-side of the open reading frame, 
a REV responsive element anywhere within the open reading frame, and a 
stop codon encoding the termination of translation of the open reading 
frame; 

c) an intemal ribosome entry site (IRES) 3' to the 
translation stop codon of the open reading frame; 

d) an open reading frame encoding a spliced HIV REV 
gene at the 3' end of which is a translation stop codon; 

e) optionally, 3' to the REV translation stop codon, a 
second IRES, followed by an open reading frame encoding 

^immunomodulatory or immunostimulatory genes, the genes being selected 
from GM-CSF, IL-12, interferon, and a B7 protein; 

f) a transcription-termination signal following the last open 
reading frames. 

36. A method of inducing an antigen-presenting cell to 
stimulate cytotoxic and helper T-cell proliferation effector functions, the 
functions comprising lymphokine secretion specific to HIV antigens, the 
method comprising: 

a) exposing cells of a vertebrate in vivo to a 
^ j)olynucleotide, the polynucleotide comprising sequences encoding an 
antigenic HIV epitope, optionally, HIV REV, and sequences encoding a B7 
protein. 

37. The method of Claim 36 wherein the HIV epitope is 
^ j;elected from env, gag, and pol. 

38. The method of Claim 36 wherein the polynucleotide 
encodes an IRES between each of the HIV epitope, the REV, and the B7 
protein. 
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39. A polynucleotide which comprises sequences encoding: 

a) an eukaryotic transcription initiation signal; 

b) an HIV gene open reading frame (ORF) preceded by 
an heterologous leader sequence such that expression of the HIV gene ORF 
does not depend on availability of the HIV REV gene product; 

c) a sequence which operates as an intemal ribosome 
entry site (IRES) 3' to the translation stop codon of the HIV ORF; 

d) a sequence encoding an ORF of a T-cell costimulatory 
element 3' to the IRES; and 

0 

e) a transcription termination signal 3' to the translation 
stop codon of the T-cell costimulatory element. 

40. The polynucleotide of Claim 39 wherein the HIV gene 
ORF in (b) is tPAgpl20 or tPAgpl60. 

5 

41 . A polynucleotide which comprises sequences encoding: 

a) an eukaryotic transcription initiation signal; 

b) an HIV gene open reading frame (ORF) preceded by 
^an heterologous leader sequence such that expression of the HIV gene ORF 
does not depend on availability of the HIV REV gene product; 

c) a sequence which operates as an intemal ribosome entry site (IRES) 3' to 
the translation stop codon of the HIV ORF; 

d) an HIV gene open reading frame (ORF) preceded by an heterologous 
Reader sequence such that expression of the HIV gene ORF does not depend 
on availability of the HIV REV gene product; and 

e) a transcription termination signal 3' to the translation stop codon of the 
HIV gene ORF. 

^ 42. A composition comprising multiple expression constructs 

each of which is capable of inducing expression in mammalian tissue of 
more than a single cistron encoding antigens related to disease causing 
pathogens or tumors. 
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43. A method for immunization of a host vertebrate 
comprising the step of: 

introducing into direct contact with tissue of the host a non-infectious, non- 

^ integrating polynucleotide encoding at least a first and a second peptide or 
polypeptide, each of which is immunogenic or immunomodulatory when 
produced as a translation products in the host wherein the first peptide or 
polypeptide is encoded by a segment of the polynucleotide which is under 
operative control of a first transcriptional promoter and the second peptide 

^^or polypeptide is encoded by a segment of the polynucleotide under 
operative control of the first transcriptional promoter, in which case no 
transcriptional terminator is provided between the polynucleotide segment 
encoding the first peptide or polypeptide and the segment of polynucleotide 
encoding the second peptide or polypeptide, or the second peptide or 

^j)olypeptide is encoded by a segment of the polynucleotide under operative 
control of a second transcriptional promoter, in which case a transcriptional 
terminator is provided between the segment of polynucleotide encoding the 
first peptide or polypeptide and the segment of polynucleotide encoding the 
second peptide or polypeptide, whereby both of the first and second peptide 

^ or polypeptide are produced within a single cell of the host, resulting in the 
immunization. 



44. A polynucleotide construct having the elements shown in 
figure 2, wherein each of the first, second and third cistrons shown in the 
figure encode a combination of any two to three of the following- 

1) tPA-gpl20MN; 

2) gp 1 60IIIB/IRES/^£V/IIIB; 

3) gpl60lIIB; 

4) REVUIB; 

5) tat/REV/gpl60; 
. 6) REV/gpl60; 

7) gpl60MN; 

8) gpl60 from clinically relevant primary HIV isolates; 

9) nef, using the gene from clinically relevant strains; 

10) Ma^lUB; 
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11) tPA-gpl20iiiB; 

12) gpl60 with structural mutations including V3 loop 
substitutions from clinically relevant strains of HIV; 
several mutations on several constructs such as variable 
loop removal, Asn mutations to remove steric carbohydrate 
obstacles to structural, neutralizing antibody epitopes; 

and CD4 binding site knockout mutants; 

13) gp41 with provision of appropriate leader sequences, 
^ as in the tPA signal peptide leader sequence; 

14) ga^: similar to construct from #5 above, using the 
gene from clinically relevant strains; 

15) rev: for gp 1 60 and ^(7;t? dicistronics; 

16) B7 coding sequences; 
^ 1 7) GM-CSF sequences; 

1 8) Interleukin sequences; 

1 9) Tumor associated antigens; 

20) Genes encoding antigens expressed by pathogens other 
than HIV, such as, but not limited to, influenza vims 

^ nucleoprotein, hemagglutinin, matrix, neuraminidase, and 

other antigenic proteins; herpes simplex vims genes; human 
papillomavirus genes; tuberculosis antigens; hepatitis A, B, 
or C virus antigens; and combinations of these and other 
antigens to form at least dicistronic constructs which may 

^ be combined with multiple other polycistronic constmcts to 

provide a cocktail composition capable of raising immune 
responses against all of the represented pathogens or 
tumor antigens; 

wherein the segments A and B of figure 2 are internal ribosome entry sites 
or a combination of transcription termination sequences terminating the 
transcription of the upstream cistron and transcriptional promoter 
sequences, initiating the transcription of downstream cistron. 
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1 TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 

51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 

101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 

151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGCG GTGTGAAATA 

201 CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG CTATTGGCCA 

251 TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 

301 TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT 

351 AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 

401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 

451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 

501 CTTTCCATTG ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG 

551 GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 

601 TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG 

651 ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 

701 GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 

751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 

801 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 

851 TTGACGCAAA TGGGCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAG 

901 AGCTCGTTTA GTGAACCGTC AGATCGCCTG GAGACGCCAT CCACGCTGTT 

951 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 

FIG.6A 

SUBSTITUTE SHEET (RULE 26) 



7/29 



1001 CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTGACG TAAGTACCGC 

1051 CTATAGAGTC TATAGGCCCA CCCCCTTGGC TTCTTATGCA TGCTATACTG 

1101 TTTTTGGCTT GGGGTCTATA CACCCCCGCT TCCTCATGTT ATAGGTGATG 

1151 GTATAGCTTA GCCTATAGGT GTGGGTTATT GACCATTATT GACCACTCCC 

1201 CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACATG GCTCTTTGCC 

1251 ACAACTCTCT TTATTGGCTA TATGCCAATA CACTGTCCTT CAGAGACTGA 

1301 CACGGACTCT GTATTTTTAC AGGATGGGGT CTCATTTATT ATTTACAAAT 

1351 TCACATATAC AACACCACCG TCCCCAGTGC CCGCAGTTTT TATTAAACAT 

1401 AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 

1451 TTCTCCGGTA GCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 

1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA 

1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 

1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCGGGGAGC GGGCTTGCAC 

1651 CGCTGACGCA TTTGGAAGAC TTAAGGCAGC GGCAGAAGAA GATGCAGGCA 

1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTGCGGTG 

1751 CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 

1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 

1851 TGGGTCTTTT CTGCAGTCAC CGTCCTTAG ATCTGCTGTG CCTTCTAGTT 

1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 

1951 GGTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 

FIG.6B 
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2001 TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGCACA 

2051 GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GGATGCGGTG 

2101 GGCTCTATGG GTACCCAGGT GCTGAAGAAT TGACCCGGTT CCTCCTGGGC 

2151 CAGAAAGAAG CAGGCACATC CCCTTCTCTG TGACACACCC TGTCCACGCC 

2201 CCTGGTTCTT AGTTCCAGCC CCACTCATAG GACACTCATA GCTCAGGAGG 

2251 GCTCCGCCTT CAATCCCACC CGCTAAAGTA CTTGGAGCGG TCTCTCCCTC 

2301 CCTCATCAGC CCACCAAACC AAACCTAGCC TCCAAGAGTG GGAAGAAATT 

2351 AAAGCAAGAT AGGCTATTAA GTGCAGAGGG AGAGAAAATG CCTCCAACAT 

2401 GTGAGGAAGT AATGAGAGAA ATCATAGAAT TTCTTCCGCT TCCTCGCTCA 

2451 CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC 

2501 TCAAAGGCGG TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA 

2551 GAACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AAAAAGGCCG 

2601 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 

2651 AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 

2701 CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 

2751 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 

2801 CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 

2851 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG 

2901 CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 

2951 TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT 

FIG.6C 
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3001 AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 
3051 GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 
3101 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
3151 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC 
3201 AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGAGGCTCA GTGGAACGAA 
3251 AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 
3301 CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT 
3351 ATGAGTAAAC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 
3401 ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCCCGT 
3451 CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG 
3501 CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA 
3551 AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC 
3601 CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT 
3651 CGCCAGTTAA TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG 
3701 GTGTCACGCT CGTCGTTTGG TATGGCTTCA TTCAGCTCCG GTTCCCAACG 
3751 ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTAGCT 
3801 CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 
3851 CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT 
3901 AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT 
3951 AGTGTATGCG GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT 

FIG.eD 
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4001 ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG CTCATCATTG GAAAACGTTC 

4051 TTCGGGGCGA AAACTCTCAA GGATCTTACC GCTGTTGAGA TCCAGTTCGA 

4101 TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 

4151 AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG 

4201 AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT 

4251 ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 

4301 GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG 

4351 AAAAGTGCCA CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT 

4401 ATAAAAATAG GCGTATCACG AGGCCCTTTC GTC 



FIG.eE 
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1001 CGGTGCATTG GAACGCGGAT TCCCCXJTGCC AAGAGTGACG TAAGTACCGC 

1051 CTATAGAGTC TATAGGCCCA CCCCCTTGGC TTCTTATGCA TGCTATACTG 

1101 TTTTTGGCTT GGGGTCTATA CACCCCCGCT TCCTCATGTT ATAGGTGATG 

1 151 GTATAGCTTA GCCTATAGGT GTGGGTTATT GACCATTATT GACCACTCCC 

1201 CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACATG GCTCTTTGCC 

1251 ACAACTCTCT TTATTGGCTA TATGCCAATA CACTGTCCTT CAGAGACTGA 

1301 CAGGGACTCT GTATTTTTAC AGGATGGGGT CTCATTTATT ATTTACAAAT 

1351 TCACATATAC AACACCACCG TCCCCAGTGC CCGCAGTTTT TATTAAACAT 

1401 AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 

1451 TTCTCCGGTA GCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 

1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA 

1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 

1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCGGGGAGC GGGCTTGCAC 

1651 CGCTGACGCA TTTGGAAGAC TTAAGGCAGC GGCAGAAGAA GATGCAGGCA 

1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTGCGGTG 

1751 CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 

1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 

1851 TGGGTCTTTT CTGCAGTCAC CGTCCTTAG ATCTGCTGTG CCTTCTAGTT 

1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 

1951 GGTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 

2001 TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGCACA 

FIG.7B 
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2051 


GCAAGGGGGA 
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TGAAGCCAGT 


TACCTTCGGA 
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3101 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
3151 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC 
3201 AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA 
3251 AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 
3301 CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT 
3351 ATGAGTAAAC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 
3401 ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCGGGG 
3451 GGGGGGGGCG CTGAGGTCTG CCTCGTGAAG AAGGTGTTGC TGACTCATAC 
3501 CAGGCCTGAA TCGCCCCATC ATCCAGCCAG AAAGTGAGGG AGCCACGGTT 
3551 GATGAGAGCT TTGTTGTAGG TGGACCAGTT GGTGATTTTG AACTTTTGCT 
3601 TTGCCACGGA ACGGTCTGCG TTGTCGGGAA GATGCGTGAT CTGATCCTTC 
3651 AACTCAGCAA AAGTTCGATT TATTCAACAA AGCCGCCGTC CCGTCAAGTC 
3701 AGCGTAATGC TCTGCCAGTG TTACAACCAA TTAACCAATT CTGATTAGAA 
3751 AAACTCATCG AGCATCAAAT GAAACTGCAA TTTATTCATA TCAGGATTAT 
3801 CAATACCATA TTTTTGAAAA AGCCGTTTCT GTAATGAAGG AGAAAACTCA 
3851 CCGAGGCAGT TCCATAGGAT GGCAAGATCC TGGTATCGGT CTGCGATTCC 
3901 GACTCGTCCA ACATCAATAG AACCTATTAA TTTCCCCTCG TCAAAAATAA 
3951 GGTTATCAAG TGAGAAATCA CCATGAGTGA CGACTGAATC CGGTGAGAAT 
4001 GGCAAAAGCT TATGCATTTC TTTCCAGACT TGTTCAACAG GCCAGCCATT 
4051 ACGCTCGTCA TCAAAATCAC TCGCATCAAC CAAACCGTTA TTCATTCGTG 
4101 ATTGCGCCTG AGCGAGACGA AATACGCGAT CGCTGTTAAA AGGACAATTA 
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■ 4151 CAAACAGGAA TCGAATGCAA CCGGCGCAGG AACACTGCCA GCGCATCAAC 
4201 AATATTTTCA CCTGAATCAG GATATTCTTC TAATACCTGG AATGCTGTTT 
4251 TCCCGGGGAT CGCAGTGGTG AGTAACCATG CATCATCAGG AGTACGGATA 
4301 AAATGCTTGA TGGTCGGAAG AGGCATAAAT TCCGTCAGCC AGTTTAGTCT 
4351 GACCATCTCA TCTGTAACAT CATTGGCAAC GCTACCTTTG CCATGTTTCA 
4401 GAAACAACTC TGGCGCATCG GGCTTCCCAT ACAATCGATA GATTGTCGCA 
4451 CCTGATTGCC CGACATTATC GCGAGCCCAT TTATACCCAT ATAAATCAGC 
4501 ATCCATGTTG GAATTTAATC GCGGCCTCGA GCAAGACGTT TCCCGTTGAA 
4551 TATGGCTCAT AACACCCCTT GTATTACTGT TTATGTAAGC AGACAGTTTT 
4601 ATTGTTCATG ATGATATATT TTTATCTTGT GCAATGTAAC ATCAGAGATT 
4651 TTGAGACACA ACGTGGCTTT CCCCCCCCCC CCATTATTGA AGCATTTATC 
4701 AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT 
4751 AAACAAATAG GGGTTCCXJCG CACATTTCCC CGAAAAGTGC CACCTGACGT 
4801 CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT AGGGGTATCA 
4851 CGAGGCCCTT TCGTC 
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1 ATTGGCTATT GGCCATTGCA TACGTTGTAT CCATATCATA ATATGTACAT 

51 TTATATTGGC TCATGTCCAA CATTACCGCC ATGTTGACAT TGATTATTGA 

101 CTAGTTATTA ATAGTAATCA ATTACGGGGT CATTAGTTCA TAGCCCATAT 

151 ATGGAGTTCC GCGTTACATA ACTTACGGTA AATGGCCCGC CTGGCTGACC 

201 GCCCAACGAC CCCCGCCCAT TGACGTCAAT AATGACGTAT GTTCCCATAG 

251 TAACGCCAAT AGGGACTTTC CATTGACGTC AATGGGTGGA GTATTTACGG 

301 TAAACTGCCC ACTTGGCAGT ACATCAAGTG TATCATATGC CAAGTACGCC 

351 CCCTATTGAG GTCAATGACG GTAAATGGCC CGCCTGGCAT TATGCCCAGT 

401 ACATGACCTT ATGGGACTTT CCTACTTGGC AGTACATCTA CGTATTAGTC 

451 ATCGCTATTA CCATGGTGAT GCGGTTTTGG CAGTACATCA ATGGGCGTGG 

501 ATAGCGGTTT GACTCACGGG GATTTCCAAG TCTCCACCCC ATTGACGTCA 

551 ATGGGAGTTT GTTTTGGCAC CAAAATCAAC GGGACTTTCC AAAATGTCGT 

601 AACAACTCCG CCCCATTGAC GCAAATGGGC GGTAGGCGTG TACGGTGGGA 

651 GGTCTATATA AGCAGAGCTC GTTTAGTGAA CCGTCAGATC GCCTGGAGAC 

701 GCCATCCACG CTGTTTTGAC CTCCATAGAA GACACCGGGA CCGATCCAGC 

751 CTCCGCGGCC GGGAACGGTG CATTGGAACG CGGATTCCCC GTGCCAAGAG 

801 TGACGTAAGT ACCGCCTATA GAGTCTATAG GCCCACCCCC TTGGCTTCTT 

851 ATGCATGCTA TACTGTTTTT GGCTTGGGGT CTATACACCC CCGCTTCCTC 

901 ATGTTATAGG TGATGGTATA GCTTAGCCTA TAGGTGTGGG TTATTGACCA 

951 TTATTGACCA CTCCCCTATT GGTGACGATA CTTTCCATTA CTAATCCATA 

1001 ACATGGCTCT TTGCCACAAC TCTCTTTATT GGCTATATGC CAATACACTG 
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1051 TCCTTCAGAG ACTGACACGG ACTCTGTATT TTTACAGGAT GGGGTCTCAT 
1101 TTATTATTTA CAAATTCACA TATACAACAC CACCGTCCCC AGTGCCCGCA 
1151 GTTTTTATTA AACATAACGT GGGATCTCCA CGCGAATCTC GGGTACGTGT 
1201 TCCGGACATG GGCTCTTCTC CGGTAGCGGC GGAGCTTCTA CATCCGAGCC 
1251 CTGCTCCCAT GCCTCCAGCG ACTCATGGTC GCTCGGCAGC TCCTTGCTCC 
1301 TAACAGTGGA GGCCAGACTT AGGCACAGCA CGATGCCCAC CACCACCAGT 
1351 GTGCCGCACA AGGCCGTGGC GGTAGGGTAT GTGTCTGAAA ATGAGCTCGG 
1401 GGAGCGGGCT TGCACCGCTG ACGCATTTGG AAGACTTAAG GCAGCGGCAG 
1451 AAGAAGATGC AGGCAGCTGA GTTGTTGTGT TCTGATAAGA GTCAGAGGTA 
1501 ACTCCCGTTG CGGTGCTGTT AACGGTGGAG GGCAGTGTAG TCTGAGCAGT 
1551 ACTCGTTGCT GCCGCGCGCG CCACCAGACA TAATAGCTGA CAGACTAACA 
1601 GACTGTTCCT TTCCATGGGT CTTTTCTGCA GTCACCGTCC TTAGATCTG 
1651 CTGTGCCTTC TAGTTGCCAG CCATCTGTTG TTTGCCCCTC CCCCGTGCCT 
1701 TCCTTGACCC TGGAAGGTGC CACTCCCACT GTCCTTTCCT AATAAAATGA 
1751 GGAAATTGCA TCGCATTGTC TGAGTAGGTG TCATTCTATT CTGGGGGGTG 
1801 GGGTGGGGCA GCACAGCAAG GGGGAGGATT GGGAAGACAA TAGCAGGCAT 
1851 GCTGGGGATG CGGTGGGCTC TATGGGTACC CAGGTGCTGA AGAATTGACC 
1901 CGGTTCCTCC TGGGCCAGAA AGAAGCAGGC ACATCCCCTT CTCTGTGACA 
1951 CACCCTGTCC ACGCCCCTGG TTCTTAGTTC CAGCCCCACT CATAGGACAC 
2001 TCATAGCTCA GGAGGGCTCC GCCTTCAATC CCACCCGCTA AAGTACTTGG 
2051 AGCGGTCTCT CCCTCCCTCA TCAGCCCACC AAACCAAACC TAGCCTCCAA 
2101 GAGTGGGAAG AAATTAAAGC AAGATAGGCT ATTAAGTGCA GAGGGAGAGA 
2151 AAATGCCTCC AACATGTGAG GAAGTAATGA GAGAAATCAT AGAATTC 
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GATATTGG CTATTGGCCA 


251 


TTG.CATACGT 


TGTATCCATA 


301 


TCCAACATTA 
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351 


AATCAATTAC 


GGGGTCATTA 


401 


ACATAACTTA 


CGGTAAATGG 


451 


CCCATTGACG 


TCAATAATGA 


501 


CTTTCCATTG 


ACGTCAATGG 


551 


GCAGTACATC 


AAGTGTATCA 


601 


TGACGGTAAA 


TGGCCCGCCT 


651 


ACTTTCCTAC 


TTGGCAGTAC 


701 


GTGATGCGGT 


TTTGGCAGTA 


751 


ACGGGGATTT 


CCAAGTCTCC 
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GGCACCAAAA 


TCAACGGGAC 


851 
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TGGGCGGTAG 
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AGCTCGTTTA 


GTGAACCGTC 
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TTGACCTCCA 


TAGAAGACAC 
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GAACGCGGAT 
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1101 
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GGGGTCTATA 
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GTATAGCTTA 


GCCTATAGGT 


1201 


CTATTGGTGA 


CGATACTTTC 
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CGTCAATGGG 


AGTTTGTTTT 


TTTCCAAAAT 


GTCGTAACAA 


CTCCGCCCCA 


GCGTGTACGG 


TGGGAGGTCT 


ATATAAGCAG 


AGATCGCCTG 


GAGACGCCAT 


CCACGCTGTT 


CGGGACCGAT 


CCAGCCTCCG 


CGGCCGGGAA 








CCCCCTTGGC 
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GACCATTATT 
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CCATAACATG 


GCTCTTTGCC 
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1251 ACAACTCTCT TTATTGGCTA TATGCCAATA CACTGTCCTT CAGAGACTGA 
1301 CACGGACTCT GTATTTTTAC AGGATGGGGT CTCATTTATT ATTTACAAAT 
1351 TCACATATAC AACACCACCG TCCCCAGTGC CCGCAGTTTT TATTAAACAT 
1401 AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 
1451 TTCTCCGGTA GCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 
1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA 
1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 
1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCGGGGAGC GGGCTTGCAC 
1651 CGCTGACGCA TTTGGAAGAC TTAAGGCAGC GGCAGAAGAA GATGCAGGCA 
1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTGCGGTG 
1751 CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 
1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 
1851 TGGGTCTTTT CTGCAGTCAC CGTCCTTAG ATCTGCTGTG CCTTCTAGTT 
1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 
1951 GGTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 
2001 TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGCACA 
2051 GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GGATGCGGTG 
2101 GGCTCTATGG GTAC GGCCGCAGCGGCC GTACCCAGGT GCTGAAGAAT 
TGACCCGGTT CCTCGACCCGT AAAAAGGCCG 
2601 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 
2651 AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 
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2701 CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 

2751 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 

2801 CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 

2851 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG 

2901 CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 

2951 TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT 

3001 AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 

3051 GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 

3101 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG 

3151 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC 

3201 AAGAAGATCC TTTGATCTTT TCTACGTGATCC CGTAATGC TCTGCCAGTG 

TTACAACCAA TTAACCAATT CTGATTAGAA 

3751 AAACTCATCG AGCATCAAAT GAAACTGCAA TTTATTCATA TCAGGATTAT 

3801 CAATACCATA TTTTTGAAAA AGCCGTTTCT GTAATGAAGG AGAAAACTCA 

3851 CCGAGGCAGT TCCATAGGAT GGCAAGATCC TGGTATCGGT CTGCGATTCC 

3901 GACTCGTCCA ACATCAATAC AACCTATTAA TTTCCCCTCG TCAAAAATAA 

3951 GGTTATCAAG TGAGAAATCA CCATGAGTGA CGACTGAATC CGGTGAGAAT 

4001 GGCAAAAGCT TATGCATTTC TTTCCAGACT TGTTCAACAG GCCAGGCATT 
4051 ACGCTCGTCA TCAAAATCAC TCGCATCAAC CAAACCGTTA TTCATTCGTG 
4101 ATTGCGCCtG AGCGAGACGA AATACGCGAT CGCTGTTAAA AGGACAATTA 
4151 CAAACAGGAA TCGAATGCAA CCGGCGCAGG AACACTGCCA GCGCATCAAC 
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4201 AATATTTTCA CCTGAATCAG GATATTCTTC TAATACCTGG AATGCTGTTT 

4251 TCCCGGGGAT CGCAGTGGTG AGTAACCATG CATCATCAGG AGTACGGATA 

4301 AAATGCTTGA TGGTCGGAAG AGGCATAAAT TCCGTCAGCC AGTTTAGTCT 

4351 GACCATCTCA TCTGTAACAT CATTGGCAAC GCTACCTTTG CCATGTTTCA 

4401 GAAACAACTC TGGCGCATCG GGCTTCCCAT ACAATCGATA GATTGTCGCA 

4451 CCTGATTGCC CGACATTATC GCGAGCCCAT TTATACCCAT ATAAATCAGC 

4501 ATCCATGTTG GAATTTAATC GCGGCCTCGA GCAAGACGTT TCCCGTTGAA 

4551 TATGGCTCAT AACACCCCTT GTATTACTGT TTATGTAAGC AGACAGTTTT 

4601 ATTGTTCATG ATGATATATT TTTATCTTGT GCAATGTAAC ATCAGAGATT 

4651 TTGAGACACA ACGTGGCTTT CO 
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